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Preface 


We are happy to honor three well-known number theorists, Henry 
B. Mann, Arnold E. Ross, and Olga Taussky-Todd. We do this at a time 
when they can look back over many years of honest toil and productive 
work. 

We, their colleagues, pupils, collaborators, and friends find it fitting to 
dedicate to them the fruits of our work so as to pass on to researchers 
coming after us the spirit of patient toil in the service of the queenly science 
which our three honorees implanted in us. 

We would like to thank the publisher for encouraging the publication 
of this special volume. We are also grateful for the overwhelming response 
by the authors upon this happy occasion. 

As a study of the table of contents of this book will suggest to the 
reader, there is an enormous variety and depth to the research efforts of the 
modern number theorist and algebraist. As a first guide to the reader who 
is familiar with the traditional classification of the fields under consideration, 
we would like to group the contents along the following lines. 

Elementary number theory is one of the most ancient sources of number 
theoretical and algebraic speculation which only now seems to be able to 
find its true depth. See for example the contribution of “Erdos” type by 
P. Erdos and E. G. Strauss. Richard K. Guy’s contribution has a more 
experimental flavor. Emma Lehmer follows in the steps of C. F. Gauss 
(1777-1855). An application of Lie algebra is made by J. W. B. Hughes. 

Numbers as objects of statistical analytic number theory investigations 
are dealt with by K. Mahler, by R. L. Graham, H. S. Witsenhausen, and 
J. H. Spencer; also by Wolfgang M. Schmidt as well as by the late 
S. Knapowski and P. Turan. 

An application of the theory of modular forms is given by Bram van Asch. 

The combinatorial aspects of number theory and their connection with 
group theory are dealt with in the contribution by John E. Olson and 
Edward T. White; also by Sundar Lai, R. L. McFarland, and R. W. K. Odoni 


XVII 



xviii 


PREFACE 


in response to a challenging problem of H. Mann on difference sets. The 
combinatorial properties offinite fields are dealt with by D. K. Ray-Chaudhuri 
and N. M. Singhi. An application of such ideas in finite projective geometry 
was made by Jill C. D. S. Yaqub. 

The algebraic theory of quadratic forms and their applications is expounded 
by Warren Wolfe. A novel more abstract treatment of the theory is given 
by Manfred Knebusch. 

The classical arithmetical theory of quadratic forms is dealt with in the 
articles by A. G. Earnest and J. S. Hsia, Robert Gold and Paul Ponomarev, 
and by Marshall Hall, Jr. 

The algebra of finite extensions was enriched by the late Bohuslav Divis in 
reply to a challenging question of A. Schinzl. 

The theory of algebraic number fields and their arithmetics is richly 
represented by the contributions of D. S. Dummit and H. Kisilevsky, 
A. Frohlich, Dennis A. Garbanati, Basil Gordon and Murray Schacher, 
Gary Queen, Ross Schipper, William Yslas Velez, by Sunder Lai, R. L. 
McFarland, and R. W. K. Odoni, and by M. Pohst. 

The principles of noncommutative arithmetics are dealt with by Wilhelm 
Plesken and by Hans Zassenhaus. 

The geometry of numbers , an ally of number theoretical research since 
Gauss and Dirichlet, which was developed as a new mathematical science 
by Hermite and Minkowski, is represented by the article of R. P. Bambah 
and A. C. Woods. 

The algebraic geometry of curves providing a rich field of investigation 
in the spirit of number theory is dealt with in the contribution of Manohar 
L. Madan and Sat Pal. An application in the same spirit on the theory of 
formal power series is given by Charles F. Osgood. 
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Henry B. Mann 


In 1975 Henry B. Mann celebrated his seventieth birthday. A mathemati¬ 
cian of international fame, Mann, in a career of more than forty years, has 
made significant contributions to algebra, number theory, statistics, and 
combinatorics. 

Henry Mann was born October 27, 1905, in Vienna. He received his 
Ph.D. degree in mathematics in 1935 from the University of Vienna where, 
as a student of Philipp Furtwangler, he wrote his dissertation in algebraic 
number theory. After a year of teaching school in Vienna and a couple of 
years spent in research and tutoring, he emigrated in 1938 to the United 
States. 

In New York he earned his living for several years primarily by tutoring. 
He had by then developed an interest in mathematical statistics, particularly 
in the analysis of variance, and in the problem of designing experiments with 
a view to their statistical analysis. He later contributed to this subject in a 
number of research papers and in his book (1949) “Analysis and Design of 
Experiments.” 

One of Mann’s most remarkable achievements was his discovery in 1941 
of a proof of a celebrated conjecture of Schnirelmann and Landau in addi¬ 
tive number theory. This conjecture had its origin in the work of L. Schnirel¬ 
mann in the early 1930s. Schnirelmann had considered a density 5(A) for a 
set of positive integers A, which he defined by 

5(A) = inf {A(n)/n \ n — 1, 2, .. 

where A(n) is the number of positive integers < n in the set A. He showed 
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HENRY B. MANN 


(most easily) that if the sum A + B of two sets of positive integers is formed 
by A + B = {a, b, a + b\a e A, b e B}, then the density satisfies the rules: 

(1) S(A + B) > 5(A) + 5(B) - 5(A)5(B). 

(2) A + B contains all positive integers if 5(A) + 5(B) > L 

From these two rules he obtained (readily) the result that any set having 
positive density is a basis for the integers (that is, if 5(A) > 0, then the sum of 
A with itself sufficiently many times contains all positive integers). As an 
application of these ideas, Schnirelmann proved (for the first time) the exist¬ 
ence of a value k such that every integer greater than 1 is the sum of at most k 
primes. This he did by showing that P + P — P is the set of primes together 
with 1—has positive density, hence is a basis for the integers. 

Out of further study of these ideas by Schnirelmann himself and by E. 
Landau, there arose the conjecture that (1) and (2) may be replaced by the 
much stronger statement: Either A + B contains all positive integers or 

5(A + B) > 5(A) + 5(B). 


This conjecture, appealing in its apparent simplicity, soon attracted wide 
attention. Many distinguished mathematicians attempted to find a proof; 
indeed, partial results were obtained over the next decade by E. Landau, A. 
Khintchine, A. Besicovitch, I. Schur, and A. Brauer. 

It was this conjecture that Mann succeeded in proving in 1941. His 
interest in the problem had been aroused through the lectures of A. Brauer 
at New York University. Actually, he proved the still sharper statement: If 
C = A + B, then either C(n) = n or 


C(«) - min A l m ) + ^ 

-> min- 

n m 


1 < m < n, m £ C. 


For his proof he was awarded the Cole Prize in Number Theory by the 
American Mathematical Society in 1946. The technique that Mann in¬ 
troduced in his proof, and its various modifications, have led to further 
important results in additive number theory and have also proved useful in 
the more general setting of additive problems in groups. 

In 1942 Mann was the recipient of a Carnegie Fellowship for the study of 
statistics at Columbia University. At Columbia he had the opportunity of 
working with Abraham Wald in the department of economics, which at that 
time was headed by Harold Hotelling. He taught for a year (1943-1944) in 
the Army Specialized Training Program at Bard College; he spent a year 
(1944-1945) as research associate at Ohio State University, and six months 
as research associate at Brown University. In 1946 he returned to Ohio State 
to join the mathematics faculty where, as associate professor (1946-1948) 
and full professor (1948-1964), he was actively engaged in teaching and 
research for many years. He held professorships at the University of Wiscon- 
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sin Mathematics Research Center from 1964 to 1971, and at the University 
of Arizona from 1971 until his retirement in 1975. 

Mann’s research interests in algebra and combinatorics cover a wide 
range. He has a special fondness, though, for algebraic number theory and 
Galois theory, and has imparted his enthusiasm for these subjects to many 
students over the years. Besides his dozen or so papers that contribute 
directly to these subjects, several of his papers on difference sets and coding 
theory contain beautiful applications of theorems on algebraic numbers and 
Galois theory. 

Neither Henry Mann’s research nor, for that matter, his teaching have 
ended with his retirement; he spent the past spring term as visiting professor 
at Oregon State University. He and his wife Anne, who this year observed 
their 41st wedding anniversary, plan to continue living in Tucson, Arizona. 

His friends, colleagues, and students wish him many more years of good 
health and enjoyment of mathematics. 

John Olson 
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Arnold E. Ross 


The following is a quotation from an address in honor of 
Arnold E. Ross 

at the Meeting of the Ohio Section of the Mathematical Association of America 
at Youngstown , Ohio , May 7 , 1976 


Tonight we are here to honor Arnold E. Ross, the man, the mathemati¬ 
cian, and the leader of men. 

We are very glad to have Arnold and Bee with us. She has been his 
companion in everyday’s perplexities and his pillar of strength. 

Born in Chicago in 1906 to an immigrant couple from southern Russia, 
Arnold was taken by his mother back to the old country in his early youth 
for a visit with his grandparents. The first World War and then the revolu¬ 
tion cut them off from the USA for many years. He received most of his 
precollege education in Odessa, a cultured city on the Black Sea which 
provided the early environment of many famous scientists, musicians, and 
politicians. 

Early influences are most formative and of the greatest impact on the 
intellectual development of a person and on the direction of his life work. 

Arnold must have been a precocious youngster since he was able to enter 
the University under a special arrangement at less than sixteen years of age. 
It was Professor Shatunovsky who was deeply interested in very gifted 
youngsters and who was a friend of Arnold Ross’ physician uncle who 
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encouraged this early start and helped with the needed arrangements. Be¬ 
sides Arnold, a cousin of his as well as one of his friends and another 
future mathematician, Felix Gantmacher, took part in the program, all of 
about the same boyish age. S. O. Shatunovsky (1859-1929), a distinguished 
Russian mathematician and educator, was one of the leaders of science and 
mathematics education in postrevolutionary Russia. Another prominent 
leader of science education in the 1930s in Russia was the energetic 
physicist Peter Kapitza (1894b.). 

A small number of farsighted distinguished Russian scientists and math¬ 
ematicians building on a centuries old excellent academic tradition 
succeeded in the twenties and thirties in designing and implementing a 
broadly based, competitive system of science and mathematics education in 
the elementary and secondary schools of the USSR, succeeding thereby in 
creating many more opportunities for the talented children of all sectors of 
the people than there had been available in czarist Russia. 

In this country, Arnold’s summer program for gifted high school stu¬ 
dents and a few other similarly oriented programs amply demonstrated in 
the sixties and seventies that a sizeable number of boys and girls at an age of 
13 or slightly older are at the peak of their learning power, full of curiosity 
and eagerness to explore new avenues of thought, ready to be lead by skillful 
teachers much further than is presently possible at the high schools of this 
country. 

Hungary and Belgium also have long traditions of talent search and 
nurture from which Arnold and his helpers have profited. 

I can testify about a similar experience in Germany, where my parents 
sent me to a new school founded in Hamburg, Germany, a few years after 
the first World War, by a group of teachers who believed in “unlimited 
horizons” of learning for boys and girls between 10 and 18. Though the 
science and math training at the Lichtwarck Schule was no different from 
the traditional fare of other Hamburgian schools, in all other subjects— 
languages, music, art and handcraft, history and government, art apprecia¬ 
tion, and philosophy—we received far more stimulation and we were given 
a good many more opportunities of independent studies than were available 
to other Hamburgian high school students. 

The blessings of early nurture of intellectual curiosity by the careful 
attention of excellent teachers make the recipients desirous of conferring 
similar blessings on the next generation. 

When Arnold came to Chicago in 1922 he was ready to study; but he first 
went to an engineering school and only in 1925 he enrolled in the study of 
higher mathematics at the University of Chicago. 

I know many other colleagues whose love of mathematics was kindled 
while they were studying engineering. Raoul Bott started out as an engineer- 
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ing student and so did Hans Schwerdtfeger. At the University of Chicago 
Arnold fell under the spell of E. H. Moore (1862-1932), the well-known 
mathematician, educator, and organizer. It was of course E. H. Moore’s 
problem-oriented method of teaching by self-discovery that left its perman¬ 
ent imprint on his mind as it is bound to influence everybody who has ever 
come under its spell.f On the basis of the earlier experience at the University 
of Odessa the new experience served as a catalyst for what became visible 
many years later. Shatunovsky and Moore both pioneered the directed limit 
generalization of the limit concept of analysis; both were very influential 
educational organizers and leaders in their great countries. 

Arnold E. Ross graduated Ph.D. under L. E. Dickson in 1931. Sub¬ 
sequently he published a number of deep researches on the arithmetical 
properties of ternary quadratic forms. The big unsolved question remains 
whether a real ternary indefinite quadratic form that is bounded away from 
zero must be proportionate to a rational form. 

Two postdoctoral years were spent at the California Institute of Techno¬ 
logy, 1931-1933, where E. T. Bell and E. Bateman were active at that time. 
The next two years saw Arnold teaching mathematics at the Chicago City 
College Cooperative College, an academic self-help effort of the depression 
years which must have presented its organizers with quite a variety of non¬ 
trivial teaching problems. 

Arnold spent his early academic career at the Department of Mathema¬ 
tics of the University of St. Louis at St. Louis, Missouri 1935-1946. In those 
years there was ample time to discuss all the problems of a small, but lively 
department of mathematics as well as the problems of higher mathematics 
and science education in a new country with colleagues like S. Warshawsky 
and G. Szego and visitors like E. Artin, all three of them recent immigrants 
from Europe. 

Following the European tradition, Arnold always took a strong interest 
in the sciences, in particular in modern physics. It was because of his friend¬ 
ship with C. Mullins and J. Guth that in 1946 they suggested to Father Kavan- 
augh, the president of the University of Notre Dame, Indiana, to ask Arnold 
E. Ross to succeed Karl Menger as the chairman of the Mathematics 
Department. One of the first things the new chairman did, as Arnold once 
told me, was to change the alphabetic listing of the freshman class lists to 
grouping by experience and readiness to study. He went in person to the 
President to achieve this change by force of his conviction. I think this little 
anecdote is characteristic of the length to which Professor Ross is likely to go 
for his students. 

t See F. Browder, “Youthful enjoyment of rigorous thinking,” University of Chicago Maga¬ 
zine , Winter 1975; and A. E. Ross, A preliminary report on a talent search program at the 
University of Chicago, November 1975. 



XXX 


ARNOLD E. ROSS 


One of his first Notre Dame students was Charles Misner, now a famous 
theoretical physicist and cosmologist who profited much from the ambitious 
undergraduate honors program developed jointly by A. E. Ross and C. 
Mullins. 

During his 28 years as Chairman of the Department of Mathematics of 
the Notre Dame University (1946-1963) and The Ohio State University 
(1963-1974) Arnold E. Ross built up from inconspicuous beginnings a cor¬ 
nucopia of original training programs reflecting the essence of his own early 
experiences at Odessa and Chicago that talent is everywhere, it only needs to 
be nurtured lovingly to come to fruition. 

Let me describe these programs in their fully developed form. 

1. Summer program for gifted high school students (1964-1974 OSU, 
1975 University of Chicago; ongoing; some NSF support; idea of program 
spreading to India and Australia, generating strong interest in Germany). 

“Is community affluence a necessary prerequisite for early excellence? 
No, also small urban or rural communities, even the heart of the city com¬ 
munities harbours early excellence, it only must be developed” [13]. 

From a national reservoir of applications between 40 and 65 gifted boys 
and girls from the ages of 14 to 17 are selected for intensive on campus 
training by the best available teachers over a period of eight weeks for one, 
two, or three summers. Supervised housing is provided in campus dormi¬ 
tories. Former students of the program and other students act as counsellors 
and helpers for the young visitors who are under their care and supervision 
during the period of the training program. 

I have often taught in these summer programs. The students I had there 
were my best ever. I could go as far with them (in essence) during a period of 
eight weeks as I could go with a normal graduate student class in the course of 
three quarters. I remember still vividly Mary, one of the gifted high school 
girls, sitting up on a tall tree in the middle of the Oval, deeply engaged in 
the study of a math book while everybody else basked in the warm summer 
sunshine. 

The talented student brings as much to the program as he carries away! 

Professor A. E. Ross discovered in a life long experience of discovery 
with his youngsters that a course in elementary number theory is particularly 
suitable to test and develop the talents of a young gifted person . Here follows a 
quotation from [14] regarding the objectives of the number theory training: 
“It is our aim to develop attitudes as well as skills, to introduce the students 
to an intelligent use of algorithms as well as to the mastery of the underlying 
theory, and to make use of number theory not only as an important stepping 
stone to the study of both analysis and algebra but also as an environment in 
which we can exhibit a whole gamut of vital dilemmas which confront any 
scientist in any field at one time or another. 
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“From the very beginning the student is given an opportunity to develop 
his powers of observation, to experiment, and to discover significant rela¬ 
tions between the objects of his experimentation. The student learns to use 
the device of counterexamples to destroy the untenable conjectures, and as 
his experience grows he learns the meaning of providing the security of a 
proof for the surviving conjectures. As is natural in all fields of human 
activity the word labels follow recognition of phenomena, and the incentive 
for the precise and concise use of language comes from the desire and the 
need to share one’s experience with others.” 

In the spirit of discovery so natural to all number theorists, and on the 
basis of the mathematical skills developed in the initial number theory 
course many other topics were taught successfully to those youngsters, e.g., 
mathematical logic (I. Thomas), geometry (J. Yaqub and H. Zassenhaus), 
linear algebra (Ledermann), algorithmic mathematics (H. Brown, H. Zas¬ 
senhaus), combinatorics (D. Ray-Chaudhuri, N. Robertson, T. Dowling, and 
R. Wilson), analytic number theory (P. Turan), geometry of numbers (K. 
Mahler, A. Woods, P. Bambah, F. H. Davenport), advanced number theory 
(A. E. Ross), discrete groups (H. Zassenhaus), discrete geometry (A. 
Heppes). As in all other projects he accomplished, Arnold Ross used the 
principle “to match the talent of the teacher with the talent of the students.” 

2. Academic Year Institute . A strong program of training master 
teachers implemented at the University of Notre Dame 1960-1970 with NSF 
support. 

Distinguishing features: Classes of not more than 20 selected mathema¬ 
tics teachers trained intensively during two summer periods of about six 
weeks each and an academic year between. 

The choice of curriculum is guided by F. Klein’s ideas on “elementary 
mathematics from the higher view point.” Training offered in number theory 
(introductory), hard analysis (real and complex), linear algebra (geometric 
algebra), and geometry (affine and projective). 

In my 1963 article! I reported about my experience of teaching geometric 
algebra while Y. Rainich was teaching the geometry. It is the renewal of 
commitment to basic mathematical ideas which distinguishes this program 
from an unstructured university program as well as from a program of 
studies in a college of education without the input of mathematicians. It is 
the three-phase nature of this particular program which gives it its depth. Of 
course the majority of the student participants continued their teaching 
career, as was the intention of the program. But many of them were placed 
into positions of increased administrative responsibility. About a third of the 
trainees went on to graduate Ph.D. at a number of universities and are now 

t H. Zassenhaus, “The concept of depth in teacher training,” Amer. Math . Monthly 70 
(1963), 85-88. 
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in academic life. One graduate (Burt Kaufman), after an unusual academic 
and teaching career, founded a Federal laboratory for mathematical curricu¬ 
lum research and development which is now operating at St. Louis, 
Missouri. 

3. Proposal to upgrade the performance of graduate student assist¬ 
ants (s. A. E. Ross, 1965) This became a living way of introducing the un¬ 
tutored incoming graduate student of mathematics at the Ohio State 
University into their subsequent course of more rigorous studies during the 
summers 1970-1974 by a specially designed summer program. It was always 
interesting to watch how the enthusiasm of the gifted high school students 
slowly infected the incoming graduate students, sometimes even the few 
older graduate students who were able to keep the pace of an ambitious 
eight week program. 

4. Basic algebra graduate course . A new course originally taught by A. 
E. Ross (1967/68, 1968/69) to inexperienced incoming graduate students 
during the academic session. Based on lots of number theoretic examples the 
basic concepts of contemporary algebra are developed. The class is 
regrouped several times, and the top section is transformed in the course of 
the third quarter or in the subsequent summer quarter to a group of budding 
professional mathematicians. 

For the three programs: 

5. Horizons Unlimited , 

6. New Careers , and 

7. Saturday Morning Program cp. A. E. Ross, 1969, 1970. The Sat¬ 
urday morning classes started out as a Sunday class at Notre Dame Univer¬ 
sity for local children who liked to be exposed to challenges of their 
mathematical inventiveness. For practical reasons the time of the weekly 
gatherings was changed to Saturday mornings. I have participated in this 
enterprise for several years; I was amazed at the distances boys and girls 
would travel (as far as from Gary to Notre Dame, or farther) to pick up 
additional information on regular polyhedra on rigid movements of the 
plane. Once, we spent nearly an hour on just discussing and illustrating the 
nature of the identity mapping interpreted as a rigid “motion.” 

Saturday morning classes are now being conducted by A. E. Ross and his 
staff in cooperation with the Columbus school system. The value of this kind 
of university input into the local community cannot be measured in terms of 
dollars and cents. Our environment becomes a better place to live in if it 
provides opportunities like the Saturday morning classes to all inquisitive 
persons. 

In a long career leading from his 1931 graduation at the University of 
Chicago to the 1976 Summer Institute at the same University Professor 
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Arnold E. Ross developed and exercised his own talents as a teacher and 
“conductor” of the mathematical community. 

His personal influence and the excellence of his building is much in 
evidence at the University of Notre Dame and at The Ohio State University 
and will remain so after his retirement. Through his Summer Institute, many 
gifted students of the State of Ohio as well as of every other State of the union 
and also of the Canadian provinces of Ontario and Quebec have profited 
immensely. What is more important, the ideas of Arnold E. Ross continue to 
develop their infectious quality in far away countries like Australia, India, 
and Germany, and his example and personal stimulation remain alive with 
colleagues like A. Woods, J. Yaqub, D. Ray-Chaudhuri, J. Riner, T. Ralley, 
and many others, including myself. 

Hans Zassenhaus 
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At the age of about 15 (a pupil of the only Mittelschule for girls in Linz, 
Upper Austria) I came to realize that science and mathematics were to be 
my subjects. Slowly this changed to “mainly mathematics,” with science still 
of great interest to this day, and in due course mathematics meant mainly 
number theory. 

I became a student of the University of Vienna where I received my 
doctoral degree, with a dissertation entitled “Ober eine Verscharfung des 
Hauptidealsatzes.” My supervisor was P. Furtwangler who had established 
the main facts about the Hilbert class field. Hilbert had himself stated these 
facts, based only on his research of relative quadratic extension fields. Only 
very few of Hilbert’s statements were proved to be incorrect, among them 
was the statement that the class field had class number equal to 1. How¬ 
ever, the weaker statement, the Hauptidealsatz, was proved correct during 
my student days by Furtwangler too, using the newly found technique of 
translating the assertion into a statement about finite groups with abelian 
commutator subgroups. With feasible problems in class field theory being 
very scarce Furtwangler was delighted to have plenty of thesis problems 
available now by using the same technique. His idea was to study the 
fields between the given field and the class field to see which and how 
many ideal classes of the ground field would become principal in such a field, 
i.e., a generalization of Hilbert’s Theorem 94. Unfortunately, this did not 
turn out to be a good idea. The technique did not work well for the inter¬ 
mediate fields and the theorems seemed to be of a chaotic nature. The results 
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of my thesis problem made Furtwangler give up class field theory forever 
after. Artin called the problems hopeless. However, I myself returned to 
them in quite recent years, based on a group theoretic proof of Theorem 94 
by M. Hall. I gave this new research the title “Hilbert’s Theorem 94,” and 
several, mainly young people, have contributed toward it. At the time of my 
recent Ph.D., however, I made only two further contributions which had a 
certain influence many years later. One was my collaboration, mainly by 
correspondence, with A. Scholz on our investigation of Theorem 94 for 
relative cubic unramified extension fields of a number of imaginary quadra¬ 
tic fields. In this case exactly one class becomes principal (a fact that Scholz 
denoted as “capitulation”). Further, we examined the group theoretic pos¬ 
sibilities for the group of the second class field for fields with 3-class group 
with two basis elements, assuming that only one ideal class becomes princi¬ 
pal. The second contribution concerned a group theoretic proof for the fact 
that for the 2-class group a group of type (2, 2), the class field has an odd 
class number. This fact, the result of the joint work with Scholz, and the fact 
that a cyclic p-class group always leads to a class field with class number 
prime to p , led to a conjecture concerning group towers whose positive 
answer would have implied that every class field tower is finite. However, the 
group tower conjecture was defeated on group theoretic grounds and finally 
Safarevic (together with Golod) showed the existence of an infinite group 
tower for a numerical case. 

Among other subjects of particular interest during my student days was 
the work of Menger on metric geometry and mathematical logic. I obtained 
some results on the former subject and participated in the Wiener Kreis led 
by Schlick. 

During the following years I obtained a position at the Mathematics 
Institute of Gottingen University as one of the editors of Hilbert’s work in 
number theory and then returned to Wien as assistant to Hahn and Menger. 
I supervised a thesis on multiply monotone sequences, learning functional 
analysis in this way. At that time two important changes happened to me. 
On the mathematical side I became interested in topological algebra and 
“sums of squares,” on the career side I obtained a scholarship at Bryn Mawr 
College and a fellowship at Girton College, Cambridge, England. Topologi¬ 
cal algebra came to me via Pontrjagin’s work on topological fields and some 
fascinating problems posed by van der Waerden in the Jahresber. deutsch. 
Math. Vereini. In a joint paper Jacobson and I generalized Pontrjagin’s 
work to locally compact rings, and I published the first topological proof 
(via spheres) of Frobenius’ theorem concerning hypercomplex systems over 
the reals which are also division algebras. Later I proved that the Laplace 
differential equations in n dimensions for n functions can be deduced 
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from generalized Cauchy-Riemann equations only for n = 1, 2, 4, 8. This 
was reproved by Stiefel and plays a certain role in combinatorics now. 

Jobs were very scarce in Great Britain then, even for British-born people 
and even more for the many foreigners taking refuge there. I obtained a 
junior teaching position in the University of London. It was then that I 
met John Todd (Jack) employed in a similar position at King’s College, 
University of London. We got married in 1938. 

In due course the war broke out and life became very difficult, e.g., we 
had to move 18 times during the war. The first year of the war was spent at 
Belfast where I supervised a young theologian, E. Best, who was also very 
much interested in mathematics. Jointly we started on the investigation of 
finite groups for which the concept of “normal subgroup” is transitive. This 
came about through a suggestion of John Todd. This work was continued in 
Italy (Zacher, Zappa) and in Germany (Gaschutz, Huppert). During my 
time in Belfast I was cut off from my own mathematical papers, but the 
library of Queen’s University there was very helpful to my needs. I saw there 
among other things the papers of McCoy on matrices with property P and 
the papers of Latimer and MacDuffee on classes of matrices. Both these 
rather different basic sets of ideas have stayed with me ever since and have 
led to substantial research on my part. I will come back to this soon. 

My London college moved to Oxford during the war, a city expected to 
be safe from air raids. However, life there was very complicated because of 
the shortage of housing for just the reason of safety. In some ways I preferred 
living in London in spite of the bomb danger. My teaching duties became 
heavier, but not more interesting. My husband had a scientific research job 
with the British Admiralty in Portsmouth and later in London. I myself 
obtained a research job with the Ministry of Aircraft Production in 1943. It 
was there that I became fully interested in matrix theory for I worked in the 
so-called Flutter Group, under the matrix expert R. A. Frazer. It was impos¬ 
sible not to come under the spell of matrix theory in that group. My interest 
in bounds for characteristic roots of matrices, in particular in Gersgorin’s 
theorem and in criteria for stable matrices, arose out of this phase of my life. 
To this I will return. However, I myself was asked to work on boundary 
value problems for hyperbolic differential equations arising from flutter at 
supersonic speed on which Temple had already made progress. There were 
new problems, and my particular problem was mentioned in Hadamard’s 
book, “Theorie des Ondes,” in a footnote, as not arising in practical work. 
After a long struggle I managed to solve it and I published it after the war in 
a book dedicated to Courant on his 60th birthday. 

In 1947 my husband accepted an invitation to the USA to the National 
Bureau of Standards to work toward the exploitation of high speed elec- 
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tronic computing machines which were under construction then. I accom¬ 
panied him and was soon employed myself there. Our chief, John H. Curtiss, 
had much understanding for my professional possibilities. 

During this year we were allowed to spend one term at the Institute for 
Advanced Study where we were appointed as members. We were attached to 
the von Neumann group. It was there that I started to shake off my wartime 
weariness and also my wartime research interests. The latter, however, were 
still helpful in my work at the National Bureau of Standards where we were 
called back a year later and where we stayed altogether for 10 years. 
Although I was given much freedom there, on the whole, I was supposed to 
devote myself to problems that could be applied to practical work. I ac¬ 
cepted part-time payment, partly to be able to return to number theory, 
partly to avoid the fixed civil service hours. During my time at Princeton I 
had contacts with Artin, Chowla, Reiner, Schafer, among others. In particu¬ 
lar a conversation with Chowla brought me back to the theorems of Latimer 
and MacDuffee on classes of matrices. At the end of my stay I gave a lecture 
at Harvard on my boundary value problem and then proceeded to Los 
Angeles, to work with Jack at the newly opened Institute for Numerical 
Analysis (INA), a field station of the National Bureau of Standards at 
UCLA. After a very cold winter in Princeton the California climate helped 
to restore me even more and my new research interests became even clearer 
to me: eigenvalues of polynomials in several matrices on the one hand, 
integral matrices on the other. For the first a suggestion by Mark Kac was 
most helpful. It was the definition of the L-property (L for linear) for pairs of 
matrices. This property means that in the pencil generated by the pair, the 
eigenvalues are linear functions. When the very gifted mathematician 
T. S. Motzkin joined INA, I told him of my partial results. We soon 
started on joint publications and proved some general facts on matrices with 
multiple eigenvalues in a pencil and a theorem (suggested to us by Kaplansky) 
concerning a pencil in which every matrix is diagonable. Motzkin in¬ 
troduced methods of algebraic geometry into this subject. I myself realized 
its close connection to complex function theory; Kato reproved our result 
via perturbation methods and included it in his big book on this subject. 
Later on I published further work on the L-property by myself and sug¬ 
gested problems to others. 

I have at all times suggested problems to others, and I had even published 
several in the section on research problems in the Bulletin. Five of those 
led to published research. One of them concerned the infinite Hilbert matrix. 
This matrix does not have n as an eigenvalue if only vectors in the 
usual Hilbert space are admitted. However, I suggested to prove that n was 
still an eigenvalue for vectors outside this space. This was shown by Kato 
and further elaborated by M. Rosenblum. 
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While still at the Bureau of Standards, I continued research on the 
theorem of Latimer and MacDuffee and started work on units in the integral 
group ring. I became interested in this via Higman’s thesis. Later I treated 
them via unimodular group matrices. I introduced M. Newman into this 
subject, and we published several papers on it; I was particularly concerned 
with the positive definite case. 

In 1957 we both accepted positions at California Institute of Technology 
and it seemed to me as if an odyssey of 20 years (I left Cambridge, England 
in 1937) had ended. I could at last work again with academic freedom and 
have Ph.D. students. I had eleven there. At the National Bureau of 
Standards I had one only, K. Goldberg, attached to the close-by American 
University. He worked on the Hausdorff formula. Otherwise I worked with 
several postgraduates like A. J. Hoffman and with visitors: Ostrowski, 
Wielandt, K. Fan, H. Cohn, J. C. P. Miller, P. Stein, Stiefel. 

When still at the National Bureau of Standards I was able to use a 
theorem by Shoda which I had cherished for a long time. It concerned 
matrices with trace = 0 or det = 1. Now at Caltech my student R. C. 
Thompson generalized this theorem to all possible fields. Another student, 
C. Hobby, with the help of Zassenhaus, contributed greatly to the solution of 
the class field tower problem. His work was completed by Serre. 

Several scholars of immense power came to Caltech during my stay and 
worked with me a great deal. After years of frequent isolation from math¬ 
ematicians working in my line, or sometimes from mathematicians 
altogether, sometimes in positions with hard duties, this was most beneficial 
for me. These were particularly in number theory Zassenhaus, Dade, and A. 
Froehlich. They were really interested in my problems on integral matrices 
and raised my understanding of my own ideas. Unfortunately, in previous 
times, even with brilliant mathematicians as colleagues I could not force 
myself to break away from my own problems, even if my progress was 
frustrating at times. Even in my student days I was alone in my number 
theoretic work. In Europe in the old days one did not call on one’s supervi¬ 
sor freely. However, this attitude helped me to develop my own problems. In 
recent years a young man Kisilevsky has been a brilliant colleague. In any 
case working with the gifted Caltech students is a great stimulus. 

Again, at Caltech, like at the Bureau of Standards, the chairman of the 
department, Frederic H. Bohnenblust, set up my job to give me much 
freedom. 

I want to return to my work on matrices at Caltech. I continued on the 
L-property, but still stimulated by my experience in aerodynamics I made 
progress on the algebraic treatment of Lyapunov’s stability criterion. This 
led to considerable work by others. Here I obtained much stimulus from 
W. Givens and from H. Schneider. My contacts with the powerful matrix 
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school of Marvin Marcus are very beneficial at all times. Work with Wielandt 
who visited Caltech was also very beneficial. 

Apart from work on the L-property I was active in other parts of matrix 
theory, other generalizations of commutativity, and in particular commuta¬ 
tors. The thesis of Gaines fits in here; that of Parker is in another area of 
matrix theory. 

At times work on matrices leads to analogous investigations in number 
theory and for real matrices. This was evident in my own work in two 
different areas: (1) Positive definite real symmetric or hermitian matrices. 
Here, e.g., my work connected with the Lyapunov theorem and two theses 
(C. Johnson, R. Loewy) fits in. On the other hand, in number theory I 
worked on positive definite group matrices (i.e., integral “symmetric,” “posi¬ 
tive definite” group ring elements) and on integral symmetric matrices (this 
is linked to positive definite similarity). For the rational case there is the E. 
Bender thesis, recently greatly extended by him. (2) Factorization into sym¬ 
metric factors (thesis of Uhlig) for real matrices and into integral symmetric 
matrices for integral matrices (so far mainly for n = 2) by myself. This last 
work is linked with norms from quadratic fields, as well as in my quite recent 
work on rational 2x2 commutators AB — BA. 

Among my matrix work there is a commutator result on which I made 
some observations in a paper of 1961 (with elegant follow-up by Zassenhaus) 
which seems to have stimulated analysts like De Prima and his students, 
Putnam, and Berberian, although its origin is from classical finite group 
theory. It concerns unitary matrices whose eigenvalues lie on less than a 
semicircle, called cramped matrices now. 

While I myself found no occasion to return to the Gersgorin theorem, 
my early light contributions and problems were taken up by several 
colleagues, in particular A. J. Hoffman, P. Henrici, J. Todd, and R. S. Varga, 
who arrived at important results. 

Of my other number theory students—L. Foster, D. Davis, and D. Maurer 
—the first two based their investigations on tables obtained by computers. 
I myself have been much interested in computational number theory, a 
subject in which my co-author A. Scholz—mentioned earlier—was a master. 
For my work on integral 2x2 matrices the tables of E. L. Ince on class 
groups and units in real quadratic fields are a wonderful help. 

Among other results obtained during my Caltech time is a “sum of 
squares” result, concerning a new 8-square identity. This was generalized by 
Eichhorn and Zassenhaus to a 16-square identity. 
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We consider a lattice L in an octave algebra V, and an even integral quadratic form q , defined on L 
It will be shown that Yj*eL «(«)=« (a 2 *, e ) 0(n* + 3 ) ifk= 1, 2, 3, and is 0(n k + 3/2+e )for any e > 0 
ifk>4. 


Introduction 

It is known that the number of variables of an even integral quadratic 
form with determinant unity must be divisible by 8. It is also known that 
there is, up to equivalence, exactly one such form of eight variables. We 
consider an octave algebra V over the field of real numbers, the underlying 
space of which is an eight-dimensional real vector space, equipped with a 
(multiplicative) norm q : V -> R. A scalar product on V is defined by (x, y) = 
q(x + y) — q(x) — q(y). It will turn out that we can choose a lattice L <= V 
such that q defines on L an even integral quadratic form with determinant 
unity. Let F be a spherical function with respect to q. By a theorem due to 
Schoeneberg (see [3]) we know that the theta series 

0(t; F) — X F(a)e 2niTqia) 

as L 

(where t is a complex variable whose imaginary part is positive) is a modular 
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form of weight (4 + deg(F)) for the full modular group. If deg(F) > 0, it is 
even a cusp form. Define for any positive integer k 

A k (n) = X ( a *> e )> n^O. 

as L, q(a) = n 

If k is odd we have A k (n) = 0. We will consider A 2k (n). It will be shown that 
by choosing special spherical functions F we obtain a relation between 
A 2k (n) and the coefficients of certain cusp forms. This will enable us to 
provet 

Theorem For any s > 0 

A 2 k (n) = P k n k o 3 (n) + 0(n k + 3/2+£ ), 

where p, = -360, p 2 = 144, p 3 = -24, and p k = 0 for all k > 4. 

In fact, this theorem states that for k > .4 the geometrical distribution of the 
2kth powers of octaves in L of norm n is utterly dissimilar to this distribution 
for k < 3. 

The contents are as follows. First we recall some facts about spherical 
functions, most of which may be found in Hecke [2, pp. 849-853]. Next we 
describe how x 2k can be expressed in terms of x and (the unit element) e, and 
finally the proof will be carried out. 

1. Let V be a finite-dimensional real vector space of even dimension 2n, 
and let (,): V x R be a scalar product. We write q(x) = ^(x, x) for any 
x e V. A homogeneous polynomial function F is said to be a spherical 
function with respect to q if 

d 2 F d 2 F 

tt + , "+-.tt = 0> where x = x ie l + --- + x 2n e 2n 

OX i OX 2 „ 

for some orthonormal basis e u ..., e 2n of V. Hecke [2, p. 853] has shown that 
we can obtain all spherical functions of degree k by taking linear combina¬ 
tions of the polynomials 



for all parameters y e V. Here H k is a polynomial in one variable T satisfy¬ 
ing the differential equation 

(1 -T 2 )^- (2n - 1)T^ + k(2n + k - 2)H k = 0. (1) 


t This question was raised by F. van der Blij. 
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From 

f 1 H k (T)H,(T)( 1 - T 2 f- 312 dT = 0 if k # / (2) 

it is easy to deduce that H k (T) is up to a scalar multiple equal to 

(t -f r »4 l(l ~ p) 

We normalize H k such that the coefficient of 7* is equal to 1. In particular we 
get 

H 0 (T)=1, H 1 (T)=T, H 2 (T) = T 2 — 


H 3 (T) 


T 3 


6n + 3 

(2n + l)(2n + 2) 


T. 


We shall need only spherical functions of even degree 2k. We have, for k > 1, 

H 2k (T) = r 2k - (D 2k + “iV* 2 k H,(n (4) 

j=i 

where 2k = 0 if j is odd. Using (2) we can determine the constant cu 2k . We 
have 


o = f H 2k {T)H 0 (T)( i - r 2 ) n_3/2 dr = f (r 2 * - cu 2/£ )(i - r 2 )«- 3/2 dr, 

* - i 1 


which implies 




r(fc + j)r( H ) 
r(fc + n)r(i)' 


(5) 


2. Let F = R 8 , considered as an algebra of octaves. As a general refer¬ 
ence for octaves we use Coxeter [1]. Let e i9 1 < i ^ 8, be the standard basis 
for V , and let the multiplication in V be fixed such that e x = e is the unit 
element of V. We define for x = (x l9 ..., x 8 ) e V, q(x) = x 2 + • • * + x 8 . 

The following formula holds: 


x 2 = (x, e)x — q(x)e. 


( 6 ) 


Furthermore q is multiplicative, i.e., 


q(xy) = q(x)q(y). (7) 

From (6) we can easily deduce that x 2k = P 2k x + Q 2k e 9 where P 2k and Q 2k 
are polynomials in the two variables X = (x, e) and Y = q(x). We have 

P 2 (X, Y) = x, e 2 (x, y) = - y. 
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The following recursion formulas hold 

P 2k = X 2 P 2k -2 ~ YP 2k - 2 + XQ 2k . 2 , Q 2k = -XYP 2k _ 2 - YQ 2k _ 2 . 

( 8 ) 

We conclude from (8) that P 2k and Q 2 u must be of the form 
P 2 k (X, Y) = k 'Z* 2 l , 2 k X 2 k - 2 '- 1 Y l , 

(9) 

Q 2 k (X,Y)=Y'ZU 2 l ' 2 k X»- 2 ‘- 2 Y'. 

1 = 0 

By induction it can easily be proved that 

/2fc -1\ 

a 2/. 2k “ (“ 1) I j 1 

and 

p 2l , 2k = (- 1)' +1 ( 2fe 0 <!<*-!• (10) 

For convenience we put a 2U 2k — 0 = P21,2k if l < 0 or / ^ k- We get 

(x 2 *, <?) = X [« 21 , 2 * + 2/J 2 ,- 2 , 2 t](x, e) 2k ~ 2, q(x) 1 . (11) 

1 = 0 

3. Let He Kbe the subset 


R - { ±e { ± ej\ 1 < i < j < 8} u 



fv(0 


i— 1 


even 


K is a root system of type L 8 . Let L be the lattice generated in V by jy/2 R. A 
straightforward calculation shows that q(L) c= Z, and q defines on La qua¬ 
dratic form with determinant unity. It may be found in Hecke [2, p. 868] that 
for any positive integer n 


#{ae L | q(a) = n) = 240<t 3 (w), (12) 


where a 3 (n) = « d3 - ,_ 

Now we take F 2k (x) = g(x)*// 2 *(( x > e)/2y/q(x)) 9 a spherical function of 
degree 2k. We know that 0 (t; F 2k ) is a cusp form of weight 4 + k. We define 

B„w- I F u w-»‘ 2 H„(rr)- 

a 6 L, g(a) = n aeL , q(a) = n \ W / 

Since there are no nonzero cusp forms of weight 6, 8, 10, we have 
B 2 (n) = B 4 (n) = B 6 (n) = 0. 


( 13 ) 
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By (4) and (12) we get 

B 2k(n) = £ (a, e) 2k + £ y 2l , 2k n k ~ l B 2l {n) - 240n*<o 2 *<7 3 (n), 

Z fl 6 L 1=1 

which implies 

X (a, <?) 2 * = X 2 kn k ~‘B 2l (n) + 240n*2 2 *a> 2 *<7 3 (n) 

flgl, = n l = 1 

for some constants <5 2 j, 2t , 1 < / < fc. Substituting this in (11) yields 

k- 1 *-i 

^2k( n ) “ X m [ a 21, 2fc + 2&21-2,2k]^2j,2k-2l nk j ^2j( n ) 

1=0 j= 1 

+ 240nV 3 (n) X [« 2 /, 2 * + 20 21 - 2 , 2 *]2 2k “ 2 '(» 2Jk _ 2 , • 

1 = 0 


Suppose now k> 4 and consider the last sum. By (5) and (10) this sum 
equals 


* 12fc(2fc — 1—1)! 

1 l\ (k — l + 3)! (fe — /)! 


12(/c — 4)! 
(fc-1)! 




1 = 0 


k\l2k-l - 1\ 

i/l *-4 I 


= 0 , 


as can easily be proved by writing T* -1 ^ + T) k = Yj=o { k i)T 2k ~ l ~ x , differ¬ 
entiating both sides k — 4 times, and substituting T = — 1. So 4 2k (n) is a 
combination of n*" J £ 2j (n), provided k > 4, and since the B 2j (n) are 
coefficients of cusp forms of weight j + 4, this establishes the part of the 
theorem for k ^ 4. 

As for k = 1, 2, 3, it follows immediately from (13) that 


A 2 (n) = — 360n<T 3 (n), 
X 4 (n) = 144n 2 <r 3 (n), 
^e(«) = — 24n 3 ff 3 (n), 


which establishes the remainder of the theorem. 
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For i = 1, 2, 3 let L, = a n Mj + a i2 u 2 + a i3 u 3 be linear forms in the variables u v u 2 , u 3 with real 
coefficients a u satisfying det(a f> ) = 1. It is proved that given any real numbers c v c 2 , c 3 there exist 
integral values of the variables u lf u 2 , u 3 such that simultaneously 

3 

niA + c^i and + c t < 0. 

»= i 


1 . Introduction 


For 1 < i < n, let L ( = a n u t + • • * + a in u n be n linear forms in the var¬ 
iables u l9 u n with real coefficients a u satisfying det(a lV ) = 1. A classical 
conjecture of Minkowski asserts that, given any real numbers c l9 ... 9 c n9 the 
inequality 

t\\ L i + Ci\ <2 _n 

i= 1 

t The research of the second author was partially supported by National Science Founda¬ 
tion Grant GP43919. 
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has a solution in integral values of the variables. This has been proved only 
for n ^ 5; see Skubenko [4]. Stemming from this problem is the conjecture 
that, given any real numbers c u ..., c„, the inequalities 

Y[ | Li + Ci | ^ 2 " +1 , Lx + c 1 < 0 

i = 1 

have a common solution in integral values of the variables. This has been 
proved by Cole [2] in the case n = 2. Our aim is to present a proof in the case 
n = 3, using a method introduced by Birch and Swinnerton-Dyer [1]. 

Let Aj = (a lj9 ..., a nj ) for 1 ^7 ^ n and denote by A the lattice of basis 
A l9 ..., A n so that d(A) = 1. Set m(A) = inf |x x ••• x„| extended over all 
points (x l9 ..., x„) of A other than the origin 0. Paralleling the work of Birch 
and Swinnerton-Dyer [1], it is sufficient to consider only those lattices whose 
homogeneous minimum m(A) is positive and attained by some point of A 
provided it is shown that the strict inequality holds in this case. An alterna¬ 
tive proof of this reduction will be given elsewhere using the divided cell 
technique. In what follows it is assumed that A has these properties. 

2. The Main Lemmas 

Let a be a positive real number with a 3 ^ 7 . Denote by K x the set of 
points (x, y) that satisfy an inequality of the form 

\x\y 0 + M x o < 4/a 
where x 0 , y 0 are positive numbers with 

^ fu . rr 1 , 4 

*0 + >>0 < Jj a + 4 J 14^2 and * 0 yo = ~- 
Similarly, denote by K 2 the set of points (x, y) that satisfy 

|x|y 0 + |y|x 0 <4 /a 
where x 0 , y 0 are positive numbers with 

^ /31 , 4 

^0 + yo < 4 / --2 and x 0 y 0 = -• 

Put d = 6(y/2 - l)/7. 

Lemma 1 Let L be a lattice in the plane with d(L) — 4/a and such that no 
two points of L are less than a distance a apart. Then there exist positive 
numbers £, rj such that the parallelogram II given by \x\q + \y\^ < 4/a — £r\ 
is Inadmissible and lies in if a 3 > 0, in K 2 if a 3 ^ 9. 

Let S denote the set of points (x, y) such that there exists a half-open 
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interval I of length a for which tel implies 

(1) | (x + t)(y -f t)(t — a) | <1 and (2) t — a < 0. 

Similarly, let S' denote the set of points (x, y) such that there exists a half¬ 
open interval I of length a for which t e I — { 0} implies (1) and (2). 

Lemma 2 (a) If a 3 > 9 , then K x c S and B(K t ) <= S'. 

(i b) If a 3 ^ 0, then K 2 ^S and B(K 2 ) c S'. 


3 . Proof of the Theorem 

Before proving these lemmas we show how the theorem follows. We first 
restate the theorem in a form convenient for our purpose. Let A be a lattice 
in R 3 with d( A) = 4. Again assume that the homogeneous minimum m(A) is 
attained and nonzero. 

Theorem Given any real numbers x 0 , y 0 , z 0 , there exists a point (x, y, z) 
of A such that 

| (x + XoMy + yo)( z + z o) | < 1 and z + z 0 < 0. 

Proof Let Xe Abe such that m( A) is attained at ±X. Applying a map 
of the form x Ax, y -► py, z -► | Xp |~ 1 z 9 where X f 0, p =f= 0, it is clear that 
we may assume either X or — X is the point (a, a, a) where a > 0. By a 
well-known result of Davenport [3] it follows that (i) a 3 ^ 7 and (ii) the 
projection L of A into the plane z = 0 parallel to the vector (a, a, a) is a 
lattice of determinant 4/a with no two of its points less than a apart. By 

Lemma 1 there exist positive numbers rj such that the parallelogram IT 
given by | x 1 1 ] -f | y | £ < 4/a = £rj is Inadmissible. By a classical result of 
Minkowski, any coplanar translates of n and L either have a common point 
or their boundaries have at least two common points. The projection of 
A + (x 0 , y 0 , z 0 ) onto the plane z = —a parallel to the vector (a, a, a) yields 
a translate of L. Hence, either there exists a point (x*, y*, — a) of this trans¬ 
late such that (x*, y*) e II, or there exist two distinct points (xf, yf, —a), 
(**> y~ a ) of this translate such that (xf, yf) and (xf, y|) are on the 
boundary of II. In the first case Lemma 2 implies that (x*, y*) e S and, since 
(a, a, a) e A, the theorem is true. In the second case (xf, y?) e S' and 
(*2 > y* ) G ^ • By similar reasoning to that used in the first case, either the 
theorem is true or (xf, yf, — a) and (x 2 ,y 2 ,— a) both lie in 
A + (x 0 , y 0 , z 0 ), which would immediately imply that m(A) = 0, the case we 
have excluded. This completes the proof of the theorem. 
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4. Proof of Lemma 1 

Let c be the largest positive number such that the parallelogram P given 
by | x | + | y | < c is Inadmissible. 

Case 1 c> 2/yJa Let n be the parallelogram obtained by putting 
<* = t] = 2/,/a so that n is Inadmissible and = 4/a. Then 

4 /14 

^ + r , = — ^ — a + 4 

yja V 3 

since (a 3/2 — 2 N /-j5) 2 ^ 0. Thus IT lies in K 1 . If a 3 < 9, then, since 9 < 4, 

4A/a <4^ a" 2 

and II lies in X 2 , which completes this case. 

Case 2 c < 2/J~a Denote by (x*, y*) one of the points of L that neces¬ 
sarily lie on the boundary of P. The symmetry of the lemma allows us to 
suppose that y* > x* > 0. By hypothesis 

x* 2 + y* 2 ^ xa 2 . 

Let C, tj be positive numbers such that 

(i) = 4/a, 

(ii) t, > t], and 

(iii) the parallelogram II given by | x 1 1 ] + | y | £ < 4/a contains (x*, y*) 
on its boundary. 

The existence of such a pair t] follows from the fact that P contains 
(x*, y*) on its boundary and c < 2 /■ s fa. We claim that this choice of C. r\ 
satisfies the conclusion of Lemma 1. In the first place, II is Inadmissible, for 
otherwise II contains a point of L linearly independent of (x*, y*) in its 
interior, which implies that d(L) < 4/a, a contradiction. Assume first that 
a 3 > 9 and, by way of contradiction, that 

/l4 

i + 9 > J y a + 4 

Since = 4/a, N /xa4 N /5« _2 = 4 / a and 4 s/A<*~ 2 > \/¥ a it fol¬ 

lows that g > 4^fha~ 2 and tj < ^/xa. 

The line xq + y£ = 4/a meets the line x = y in the point 
(4/a(£ + rj), 4/a(£ + rj)) whose distance from O is 

4j2 4j2 

a(£ + ri) a(,/9 a + 4^ a~ 

since 9 *£ a 3 . Hence the line xrj + y£ = 4/a meets the axis x = 0 and the line 
x = y in points both less than a distance from O. Since y* > x*, this 
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does not permit (x*, y*) to lie on the boundary of II, a contradiction. There¬ 
fore we may assume a 3 ^ 9 and, by way of contradiction, that 



so that, in particular, £*rj* = 4/a, <!;* ^ rj *, and + yj* = 4 y/^a~ 2 . The line 
xrj* + y£* = 4/a meets the line x = y in the point ( v /|a, y/?a) which is at 
distance from 0. The same line meets x = 0 in the point (0, 4/a£*) 

whose distance from O is 



since a 3 ^ 6. By the initial assumption, £ > £* and rj <rj* and so the line 
xrj + y£ — 4/a meets x = 0 and x = y in points both less than a distance 
y/^-a from 0. Again this implies that (x*, y*) cannot lie on the boundary of 
n, the contradiction that proves Lemma 1. 

5. Proof of Lemma 2 

Let (x, y) g CL (K t ) if a 3 > 6 e CL (K 2 ) if a 3 < 6 . Let x 0 , y 0 be a pair of 
numbers that can be associated with x, y in the definition of K x and K 2 so 
that | x | y 0 4- | y | x 0 ^ 4/a = x 0 y 0 and, therefore, by the arithmetic- 
geometric mean inequality 

l*H < i (i) 

with strict inequality unless (x, y) is not in either K l or K 2 . Thus, if 
| xya | = 1, the hypothesis of the lemma allows us to exclude the value t = 0. 
We shall make repeated use of this fact in what follows without expressly 
calling attention to it. From the symmetry in x, y we may assume | x | ^ 
\y\. Further, |y| ^4/ax 0 = y 0 , so that an upper bound for y 0 is also an 
upper bound for y. If (x, y) e CL (K x ), then 

yo<4ySa- 2 = (1.8516...)a- 2 . 

On the other hand, if (x, y) e CL (K 2 ), then 

J’o < (2v^ + - a3 ) a ~ 2 < 4 2 = (2.6186.. .]a~ 2 . 

Thus, in either case, y 0 ^4^/fcr 2 . 
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Case A x < 0 Put /(f) = (x + t)(y + t)(t - a). We shall show that if 
0 < t < a, then | /(f) | <1. We distinguish cases: 

(i) x < — a In /(f) the coefficient of f is xy — ay — ax = c(t), say. If 
y < 0, then c(f) > 0 since all terms are nonnegative. If, however, y > 0. then 
c(t) < 0 since | y | > | x |. Hence f (f) is monotone on [0, a]. Since | /(0) | <1 
and / (a) = 0, the result follows. 

(ii) — a < x < 0 and y > 0 Again c(t) < 0 so that /(t) is monotone 
decreasing on [0, — x]. Since |/(0)| <1 and /(—x) = 0 it follows that 
| /(t) | < 1 on (0, — x]. On the other hand, if f e [—x, a), then 

1/(01 = I(t + x)(f + y)(f-a)| <|a 2 (|y| + a) < 1. 

(iii) — a ^x ^0 and y <0 Again c(t) >0 so f(t) is monotone in¬ 
creasing on [0, — x] and, as before, \f(t)\ < 1 when t e (0, — x]. If 
t e [ — x, a), we repeat the argument under (ii) to obtain |/(f)| < 1. 

Case B x > a f(t) has a single extremum in [ —x, a] and therefore is 
monotone on at least one of the two intervals [ —x, 0] or [0, a\ and so the 
result follows as before. 

We must now distinguish between the cases a 3 > 9 and a 3 ^ 6. Assume 
for the time being that a 3 >6 so that, in particular, 

y 0 <4vrk<J~ 2 - 

Case C 0 < x < ?a For 0 < t < a we have 
| (x + f)(y + f)(f - a) | = |y 0 (x + t)x 0 (y + f)(f - a)/x 0 y 0 | 

+ 0 + *o(|y| + t)) 2 ia(a - f) 

< (a ~ 1 + + 4 v /3ra _2 )) 2 a(u — t), 

and, putting t = Aa, so that 0 < A < 1 this is 

(1 + lX(y/^a 3 + 4 X /^)) 2 (1 — A) < (1 + sA) 2 (l — A) = #(A), say, 

where s = + yfh < -772. Then g'(X) = (1 + sA)(2s - 1 - 3sA) < 0 if 

\ < A < 1. Since #(1) = 0 and g(j) < 1, it follows that when \a ^ t < a we do 

have 

I (* + t){y + t)(t - a ) I < 1 

as desired. Now suppose 0 < f < \a, so that 0 < X < Put x = fia so that 
0 < fi < Then 

| (x + t)(a - t) | = a 2 (A + /r)(l - A) < ia 2 (l + fi) 2 . 


y | ^ x 0 y 0 ~ M y 0 

Xq 


= yo - ina 2 yl ■ 


whereas 
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Hence 

I (x + f)(y + t)(t - a) I < i(l + n) 2 (a 2 y 0 - ifia 4 y% + a 3 A) 

< i(l + n) 2 (a 2 yo ~ \na*yl + i« 3 ) 

< i(l + n) 2 (a 2 y 0 - ifi(a 2 y 0 ) 2 + 4). 

But a 2 y 0 < 4y^ and, as a function of z, z — */rz 2 has a maximum at 
z = 2/fi> 4^/14 . Therefore 

| (x + f)(y + r)(t - a) | < i(l + /i) 2 (4 N /5 + 1 - in) = %), say. 

Computing, we find that h'(fi) > 0 if 0 ^ ^ and also h(j) < 1 . This com¬ 

pletes Case C. 

Case D \a < x < fa Put x = fia, so that \ < n < |. Assume first that 
y < 0. We shall show that the interval 0 < t < a will suffice. To begin 

(x -I- t)(a — t) < 5(0 -I- x) 2 . 

Thus, if y> —a, then 

| (x + t)(y + f)(f — a) | < ia(a + x ) 2 < a 3 < 1 . 

On the other hand, if y < —a, then 

I(x + t)(y + t)(t - a )| <i|y|(a + x ) 2 

< 4(1 + n) 2 (a 2 y 0 - 4 p(a 2 yo) 2 ) 

< i( 1 + m) 2 (4-v/S - iM 4 >/5) 2 ) = 'V)> say. 

Computing as before, i'(jx) is positive and so i(fi) is monotone increasing in 
[j, |] and i(f) < 1, which takes care of y < 0. Now assume that y ^ 0. We 
show that the interval (—i — A^ t ^ (I — A*) a will suffice. Put t — la so 
that — i < A + Ai ^ Suppose that 0 ^ A + n ^ f. Then with 

/(f) = I (x + t)(y -I- t)(t - a) I = (A + p)(l - A) I (a 2 y + a 3 A) |, 

if a 2 y + a 3 A <0 it is clear that /(f) < 1, so we may assume that 
a 2 y + a 3 A > 0 and so 

f(t) < (A + /z)(l - A)(4 V /^ - 7/f + a 3 (2 - /r)) 

< (A + /f)(l - ^.)(4 X /S + i ~ ¥(*) = say. 

Computing as before, we find that f(/i) is positive in the range in question 
and so the maximum value of j(n) occurs when X + /i = f. Hence 

/(t) < 1(1 - A)(4 X /S - -r + -^A) = A(A), say. 

Further 

A(A) = |(1 — A)(a + /?A), where a = 4^/5 — t* and = 1 t. 
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has a maximum value of 

3(# + 0) 2 /160 < 1, 

and so f(t) < 1 when 0 < A + g < f. Now suppose that < A + g 0. 
Then /(t) is not larger than one of the two expressions 

|(A + a*)( 1 - A)a 2 y| and |(A +/i)(l - A)a 3 A|, 

both of which are less than 1 since a 2 y < 4^/^j — \g(^s/^i) 2 and |A| < 1. 
This clears Case D. 

Case E f < g < 1 Assume first that y > 0. Since f(t) has zeros at a 
and —x and has no zero between, it follows that either the interval 0 < t < a 
will suffice or/(t) < 1 on the interval — x < t < 0. But if — a < t < — x, then 
/ (t) is not larger than one of the two expressions 

| (x + t)(a - t)y | and | (x + t)(a - t)a \ 

both of which are less than 1 since a 2 y < 2. 

Now suppose that y <0. As in Case D, if y> -a, then the interval 
0 < t < a will work. Hence we may assume that y < —a. Again either the 
interval 0 <t< a suffices or f(t) < 1 for -x < t < 0. Thus suppose that 
— a < t < —x < —|a. Then |x + t | < ?a, \a — t | <2a and so 

f(t) <i(a 2 |y| + a 3 ) <1(4^/^ - iP + 4) < 1- 

This completes Case E and we have now covered all the possibilities when 
a 3 > 9. Thus, assume from now on that a 3 < 9. 

Case C 0 <x Set y* = 2^/4 + 2 y /%- s and let 0 < g, |A| < 1 

be constants. 

Auxiliary Lemma // A + 2g yj\, then the maximum value of g(s) = 
y* — jfiy* 2 + sA is at s = 0 in the range [0, 4]. 

Proof g'(s) = (4 - s)~ 1/2 (A,/4 -s- 1 + igy*). By hypothesis 
A + g < (1 - *VD(4)“ 1/2 < (1 - Psfe)(i ~ s)~ m , 
hence A(4 — s) 1/2 - 1 + jgy* < 0 and g'(s) 0. This proves the lemma. 

Assume that 0 < t < a. Then 

f(t) = | (y 0 (x + + 0(« ~ t)/x 0 y 0 | 

< (i^o | x + c | + ix 0 | y + 1 1 ) 2 1 a - 1 1 a 

< (a -1 + t N /4«~ 2 ) 2 | a — t\a. 

Put t = Aa so that 0 < A < 1 and 

/(t)<(l + x /4A) 2 (l-A) = n(A), say. 
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Computing in the usual way, we find that n'(A) < 0 if A ^ 7 . However, 
n(l) = 0 and n(|) = (1 + 21“ m ) 2 f < 1. Hence /(f) < 1 if i«A<l. So 
assume 0 < A < ^. Since with x = pa, so 0 ^ p <4, it follows that 
A + 2/x < f < v /j. Thus the auxiliary lemma applies and 

f(t) < (A + p)(l — A)(a 2 |y | + Aa 3 ) 

(A + ^)(1 - A)(a 2 y 0 - + ^« 3 )- 

Since a 3 0, so y 0 a 2 < 2^/f + 2^/4 - a 3 = y*. Recalling that z - i/fz 2 is 
monotone increasing for z ^ 2/^ and 2//i ^ 4 > y*, so 

/(t) < (A + ff)(l - A)(y* - {py* 2 + Aa 3 ) 

< (A + ji)(l - A)(4yf% — *fp) by the auxiliary lemma, 

<i(l + ^) 2 (4v1-^) 

by the arithmetic-geometric mean inequality, = p(p) say. The customary 
computation shows that p'(p) > 0 since p <3 and so p(p) p(i) < 1. This 
completes Case C. 

Case D < x < ,694a Assume first that y> 0. Again put x — pa and 
t = Aa so that \ < p < .694. We show that the interval for f given by 
— .306 — p < A < .694 — p will suffice. We have 

f(t) = | (x + f)(y + t)(a - t) | = | (A + p) | (1 - A)(a 2 y + a 3 A). 

Here we may assume that a 2 y -I- a 3 A > 0 for otherwise, since | A | ^ 1 and 
|A + fi \ ^ .694, it follows easily that f(t) < 1. Further, with y* = 2^/f + 
2^/4 — a 3 , the previous argument applies because 2//i ^ 2/.694 > 4^/f. 
Therefore 

/(f) < | (A + p) | (1 - A)(y* - ipy* 2 + a 3 A). 

But A + 2p < .694 + .694 < v /j and the auxiliary lemma applies. Hence 

/(f)< |(A + / i)|(l-A)(4 v /4-¥^). 

Suppose first that 0 ^ A + ^ .694, so that 

| A + fi\(\ — A) = (A + /i)(l — A) 

which, as a function of A has a maximum at A = ^(1 — /*). However, A is 
always less than this value since 

2A + n ^ .694 + (.694 - .5) < 1. 

Hence we are justified in replacing A + /i by .694 and so 

/(f) .694(306 + p){4^ - ¥ff) = <?(/*), say. 
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The customary computation of the maximum of q(fi) yields 
q(n) < .694(1 4- .306V1) 2 = .999899... < 1. 

Suppose now that — .306 < 1 + /i < 0. Then 

|A + /f|(l-A) = -(A + /r)(l-A), 
and, since A < 0, this is a maximum when A = — .306 — fi. Therefore 
f(t) < .306(1.306 + //)(4 V /| - Wi) = r(n), say. 
r(fi) has its maximum at fi = — .653 < Since fi ^ j it follows that 

r(n) < . 306 ( 1 . 806 )( 4 x /7 - $) < 1 . 

Thus the lemma holds in this case if y ^ 0. 

Now assume that y < 0. Again we may assume that y < — a for other¬ 
wise, taking 0 < t < a, we have/(t) < 2a • a • a < 4 < 1. Then/(t) has roots 
-x, a, and — y > a and so either 0 < t < a will suffice or /(f) < 1 on 
—x < t < 0. Hence we may assume that /(f) < 1 on — fi < A < 0. In addi¬ 
tion, if 0 < A < .694 — n, the argument for y > 0 above goes through vir¬ 
tually unchanged since if a 2 y + a 3 A is positive then an upper bound is .194a 3 
and /(f) < 1, whereas if a 2 y + a 3 A is nonpositive, then its modulus is 
bounded above by a 2 \ y | — ifiy* 2 and the auxiliary lemma may be 

applied with A = 0 yielding the same bound as before. Hence we may assume 
that —.306 < A 4- fi < 0. It is then evident that, as a function of A,/(f) is a 
maximum when A 4- /i = — .306. Hence, incorporating previous arguments 

/(f) < .306(1.306 + n)(y* — ifiy* 2 + a 3 (ju + .306)) 
where y* = 2^/f 4- 2-J^j — a 3 . 

Put u(a 3 ) = y* — {ny* 2 + a 3 (fi 4- .306) and, for simplicity of notation, 
a = ^ and ft = .306. Then u(a 3 ), considered as a function of a 3 , attains its 
maximum at 



and with this value of a 3 , after a little more computation, 

y* — iny* 2 + u 3 (/f + -306) = (/fa 4- a/? 4 - l) 2 /(2/f 4- /?). 

Hence 

/(t) P(l + /} + fi)(fix + x/1 + l) 2 /(2/i 4- P) = v(fi) 
say. Now v'(/i) has the same sign as 

(fix + xp + 1 4- 2a(l + p + fi))(2fi + P) — 2(1 4-/4- fi)(fix 4- a)3 4- 1) 
which is a quadratic in fi with leading coefficient 4a > 0. Further, at fi = 0 its 
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value is <xp 2 — p — 2 < 0 since a = and p = .306. It follows that the 
maximum value of v(/i) on \ < p < .694 is 

max(r(i), t;(.694)) = max(,987..., .989...) < 1. 

This completes Case D. 

Case E .694 a < x < a Assume first that y > 0. We show that the 
interval — a < t < 0 will suffice. IfO < y < a and —a < t < 0, it is trivial that 
/(f) < 1. Thus we assume that y > a. As before,/(t) has roots -x, a, and 

— y<— a<— x and so either /(f) < 1 on 0<t<a or /(f) <1 on 
—x < f < 0. Thus we need to show only that /(f) < 1 on — a < t -x. 
Again put t = Xa and x = pa so that -1 < X -p, and .694 < p < 1. Now 

/(f) = \X + fi\(a 2 y + a 3 2)(l - X). 

But a 2 y + a 3 X < a 2 y < a 2 y 0 — i(a 2 y 0 ) 2 < 1/p, the maximum value in z of 
z — ifiz 2 , < 1/.694. Hence 

/(f) < .306(2)(l/.694) < 1. 

Therefore we may assume from now on that y < 0. Exactly as before we may 
assume that y < -a so that either /(f) < 1 on 0 < f < a or /(f) < 1 or 

— x < t < 0. Hence it remains to show that f(t)< 1 on — a < t < — x, i.e., 
on — 1 < X < —p. In this case 

/(f) « 2(1 - p)(a 2 |y| + a 3 ) < 2(1 - p)(a 2 y 0 - ip(a 2 y 0 ) 2 + a 3 ). 
Suppose first that .694 < p < .76. Since 

2/p S* 2/.76 > 4/i 

so /( f ) ^ 2(1 - p)(y* — \ny* 2 + a 3 ) and clearly this is greatest when p is 
least, namely .694. However, the case X = — 1, p = .694, y < 0 has already 
been considered, and we have shown here that/(f) < 1. Hence it remains to 
consider the possibility that .76 < p < 1. Let -a < t < 0. Arguing as before, 
it is sufficient to consider -a < t < -x, i.e., -1 < X < — p and then 

/(f) < |A + p||l-A|(a 2 |y| - a 3 l) 

^ | A + p| 11 _ A| (i + « 3 ) < (2A)(2)[~ + ^) < 1, 

and this completes the proof of Lemma 2. 
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// a + P ^ not a generator of the composite field K( a, where K( a) and K(p) are linearly disjoint 
separable algebraic extensions of the field K of degrees m and n, then char K is a divisor of mn or it 
is smaller than \ min(m, n). If ap does not generate /C(oc, 0), then both fields K( a) and K(f) are , 
essentially , radical extensions of K. 

Let K be a field and let £(x) and /£(y) be simple separable algebraic 
extensions of K of finite degrees m and n, say, such that [K(x, y):K ] = mn. 
The composite field K(x, y ) is also a simple extension of K , and experience 
tells us that it is usually generated by x + y. This has been confirmed by 
Isaacs [1] for char K = 0, or char K = p \mn and p > min(m, n). Another 
proof of a weaker result in a more general context has recently been given in 
[21 

The purpose of this article is twofold. Firstly, we shall show that 
y) = ^( x + y) already if p \ mn and p > \ min(m, n). Secondly, we shall 
study the analogous question for the products and give a complete descrip¬ 
tion of the extensions in which xy does not generate K(x , y). They turn out 
to be, essentially, radical extensions. 

t Deceased. 
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Let us assume that [K(x, y):K(x + y)] = k> l. The five fields K, (K(x), 
K(y), K(x + y), and K(x, y) will be said to form a deficient system (with 
deficiency k). Let us denote by x = x 1( x 2 ,•••_> x m and y = yi, yj, • • •, y n the 
conjugates of x and y. Let us denote by K x , K y , and K x ,, the normal 
closures of K(x), K(y), and K(x, y) over K. 

By our assumption, all the Galois groups Gal(K J ,(x i )/X(x j )) (1 < i < rn) 
are transitive on the y’s and the groups Gal(-K x (yj)/K(yj)) (1 <j < n) are 
transitive in the x’s: Therefore, Ga\(K x y /K) acts transitively on the sums 
Xj + yj i (1 < i < m, 1 <j< n). Since we assume that deg(x + y) = mn/k, we 
must have a system of mn/k equations of the type 

x «i + yji ~ x h + yji =''' = x i k + yjk > 

where the sums of x f + yj in different equations are distinct and the group 
Gal(K* , y /K) acts on the equations transitively. 

Two equations of the type (1) will be called directly linked (with respect 
to x) if there is x f that appears in both of them. Two such equations will be 
called linked (with respect to x) if there is a chain of directly linked equations 
connecting them. Contrary to direct linkage, the concept of linkage is an 
equivalence relation and our equations will be partitioned in equivalence 
classes. An equivalence class containing x t , say, in one of the equations will 
contain all equations containing x f . Therefore, the class that contains x = x t 
will contain all equations in which occur some of the conjugates x t , x 2 , • • •, 
x m , say. By the transitivity of Gal(K x y /K ) on the equivalence classes, all 
classes will likewise contain ni! conjugates of x, and thus m { j m. Every 
transformation from Gal(^, y //C) that carries over x f (1 < i < m,) into Xj 
(1 <j < mi) fixes the whole equivalence class containing x x and thus it 
permutes the elements x u x 2 , • • •, x mi and fixes the field 

K' — k[ x f ; Yj x i x j5 • • •» x i x 2 '‘' x mi)• 

\i-1 ! 

Therefore, K' c K(x) and the elements x„ x 2 , ..., x m , are conjugate with 
respect to K' and their degree is mi \ m. Furthermore, we have K(x) = K'(x), 
K(x, y) = K'(x, y), and K(y) c K’(y) <= K'(x, y). It follows that 

[K'(x, y):K'{y)] = [K’(x):K ] = m, 

and 

[K'(x, y):K'(x)] = [K'(y):K'] = n. 

Moreover, [K'(x, y):K{x + y)] = k since the equations (1) are unchanged. 

In this way we have obtained the same situation over the ground field K' 
as we had over K, namely the deficiency k by which the degree of x + y has 
to be divided is unchanged. Now we shall perform the same kind of reduc- 
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tion over K' with respect to y. If it happens that we destroy the linkage with 
respect to x, we shall perform the reduction with respect to x again, and if 
necessary, once more with respect to y, etc. After finitely many steps, we shall 
have a ground field K* cz K(x, y) such that 

[K*(x, y):/:*(x)] = [/C*(y):K*] = n*|m, 

[K*(x, y):K*(y)] = [K*(x):K*] = m*\m, 

{K*(x, y)\K*(x + y)] = k > 1, and all equations (1) 

for the sums x, + yj (1 < i < m*, 1 <j<n*) will be linked both with respect 
to x and y. The five fields K*, K*(x), K*(y), K*(x + y), and /C*(x, y) will be 
said to form a primitive deficient system (with parameters p, m*, n*, and k). In 
the following we shall restrict ourselves to primitive systems only. 

Remark If K, K(x), K(y), K(x + y), and K(x, y) form a primitive 
system, there still can be a nontrivial subfield between K and K(x) or K(y). 

Theorem 1 Let K, K(x), K(y), K(x + y), and K (x, y) form a primitive 
system with parameters p = char K, m = [X(x):/C], n = [K(y):/£], and 
k = [K(x, y):K(x + y)] > l. If p \ mn/k, then K x = K y = K x y , and if p \ m, 
then K x s K y , i.e., m | (n — 1)!. 

Theorem 2 If K, K(x), K(y), K(x + y), and K(x, y) form a primitive 
system of characteristic p, then 

Proof of Theorem 1 Primitive systems have one important property, 
namely every x, (1 < i < m) can be written in the form 


n n 


x i - x i + Z ^yj ’ Z A# = o 

1 j= i 

and every yj (1 <j < n) can be written as 

(1 < i < m) 

(2) 

m m 

yj = yi + Z Pij*i - £ nu = o 

(1 <j < n) 

(3) 


i'=l t=l 


where the A’s and p’s are rational integers. This is a consequence of the 
two-sided linkage. Note that the representations (2) and (3) may not be 
unique. Using (2) we see that 

tr(x + y) = (mn/k)x t + Y, 

when Y e K y . If p \ mn/k, then it follows that x t e K y and this implies 
K X Z K y . If we use Eqs. (3), we similarly obtain K y ^ K x .Upfm, then (2) 
implies tr x = mx t + Y, Y e K y , and that yields Xj e K y , which in turn 
implies K X Z K y . 

Proof of Theorem 2 Without loss of generality we may assume that 
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p \ mn , and hence we may assume that tr x = tr y = 0. By (2), we have 

x i = E Wj > E V J = °> Vj e K (1 <j<n). (4) 

j=i J =i 

For every (p e G = Gal(K y (x 1 )/K(x 1 )) is 

« 

*i= E v Myj)- 

j= 1 

If we add up all these equations, we obtain 


[K^y.Kixj] ■ Xl = E E v Myj ) = EI vtoi) = E v ; tr y = °- 

<peG j=l j=l <peG j-i 


This implies 

p|[^* 1 ):#:(x 1 )]|[« x .,:K]. 

Theorem 3 77ie condition p | [K, v : X] must be satisfied in every deficient 
system. If m > n and p > n, then p\m. In particular, p < max(m, n). 

Proof The condition p | [K x y :K]isa consequence of Theorem 2 and of 
the “divisibility preserving” property of every reduction procedure to a prim¬ 
itive system. The second statement also holds for primitive systems. 
Namely, in the proof of Theorem 2 we found that p f mn implies 
p|[K y (x 1 ):X(x 1 )], and Theorem 1 says that K y ( Xl ) = K y if p \ mm. There¬ 
fore, we must have p\[K y :K]\n\, which is tantamount to p < n, and as a 
consequence of the divisibility preserving property of every reduction 
procedure, it holds for every deficient system. 

Theorem 4 If one of the parameters morn of a deficient system is a prime 
power, then p \ mn. 

Proof It suffices to consider primitive systems only. If n = q p and 
p \ mn, then we have Eq. (4) as in the proof of Theorem 2. As a next step, 
however, we shall not apply the full group G = Gal(X J ,(x 1 )/X(xj)) to it but 
merely its Sylow q-subgroup G q , say. It is well known that G q is transitive on 
the y’s and therefore does not depend on j. It follows that 

I gj • xi = E E vMyj) = E v j E <p(yj ) = °> 

(p e G q j= 1 j= 1 <p 6 Gq 

which is a contradiction. 

Theorems 1-4 have been first proved by Isaacs [1]. The proofs given here 
are not essentially different, but they seem to be using simpler language. 
Also, they differ in the concept of a primitive system. 
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Now we would like to propose the following 

Problem Describe all primitive deficient systems. 

In this connection, it seems to be reasonable to introduce the following 
definition. A primitive deficient system will be called regular if k = n < m, 
and irregular otherwise. 

Theorem 5 In a regular primitive deficient system with k — n <m is 
necessarily m = p a , where 1 < a < n — 1. 

Proof We have [K(x x + y x ):K] = m and K(x 1 + y u y t ) = K(x u y x ), 
and therefore the fields K(x 1 + yj and K(y^ are linearly disjoint. If we have 
x i + yi = x 2 + y 2 , say, we have x 2 = x t + y x — y 2 . We can fix y 2 and send 
*i + yi to an arbitrary sum x f + yj and then fix x f -f yj and send y 2 to an 
arbitrary y t . This shows that every expression x, + yj — y t is among the 
conjugates of x v It follows that every expression 

*i+t*jyj (£^ = °> o<A ; < P -i) 

is a conjugate of x v Conversely, by (2) every x, can be written in such a way. 
If we denote by a the number of linearly independent conjugates of y u then 
m = p a (elements y l9 y 2 , y r are called linearly independent if 

EJ-i *jyj£ K - h e z p =>A! = x 2 = ••• = x r = o). 

The primitive systems can be described by the equations (1) satisfied by 
the conjugates x, + yj. Two systems will be considered the same if the 
equations differ by a permutation of the conjugates x l5 x 2 ,..., x m and y u y 2 , 
■ ■■,}>„ only. By abuse of language we shall refer to these systems of equations 
as to the primitive systems, regardless of their algebraic representation. We 
shall write for brevity i j = k l instead of x, + y ; = x k + y,. 

If 2 = n < m, then k = 2, and thus m = p by Theorem 5. We have one 
regular primitive system for every p with equations 

11=22 

2 1 = 32 

p 1 = 12. 

U3 = k = n<m, then m = p 2 or p. For every p, we have one regular system 
with m = p 2 . The equations can easily be written down. For p = 3 and for 
every p = 1 (3) there is one regular system with m = p. Let us remark that 
for p = 3 is tr y = 0 if m = 3, and tr y 0 if m = 3 2 . 

If 4 = k = n < m, then m = p 3 , p 2 , or p. For every p, there is one regular 
system with m = p 3 . If m = p 2 , there is one regular system for p = 2 and 
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p = -1 (4) and two regular systems for p s 1 (4). Namely, for every p there 
is the possibility + y 2 = y 3 + y* = °> and for P = 1 (4) we also may have 
y 3 = Ayj + (A - 1 )y 2 , where A 2 = - 1. If p = 1 (4), then there is also one 
regular system with m = p. 

Of particular interest are the irregular systems. There is just one such 
system for 3 = n < m, with k = 2, m = 6, and p = 3. The equations are 

11=22 41=53 12=43 

21=33 51=62 32=63 

31 = 42 61 = 13 52 = 2 3. 

For 4 = n < m, there is also just one irregular system, with k = 3, m = 6, and 
p = 2. The equations are 

11=22=33 41=24=63 

21=12=44 42=14=53 

31=13=54 34=51=62 

32 = 23 = 64 43 = 52 = 6 1. 

Although the proofs of the last statements are rather tedious, they are not 
difficult in principle and will be omitted here. 

From these examples it might seem that mn/k is always a p-power. 
However, this is not the case. Namely, for n = 6, m = 10, k — 4, and p = 2 
(mn/k = 15) we have the irregular system 

11 = 22 = 33= 44 16 = 62 = 83 = 10 4 34 = 43 = 56= 65 

12 = 21 = 55= 66 23 = 32 = 96 = 10 5 35 = 64 = 71 = 10 2 

13=31=75= 86 24=42=76= 85 36=54=81= 92 

14 = 41 = 95 = 10 6 25 = 51 = 84 = 10 3 45 = 63 = 82= 91 

15 = 52 = 73= 94 26 = 61 = 74= 93 46 = 53 = 72= 10 1. 

Although we could construct infinitely many of such examples, namely 

m = 2 2t+1 + 2‘, n = 2 2,+1 - 2', k = 2 2 ‘, p = 2, 

— = 2 2(+2 — 1, r= 1,2,3,..., 

k 

we were as yet unable to construct a system with p \ mn. However, we can 
prove the following strengthening of Theorem 3. 
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Theorem 6 If m>n and p \mn in a deficient system , then p < \n. 

Proof Let us suppose that \n < p < n. By Theorem 3, p must divide the 
order of the group Gal (K y (x)/K(x)) and therefore the group must be primi¬ 
tive on the y s. Since it contains a p-cycle, the group must be 
(n — p + Intransitive. If we have x x + y t = x 2 + y 2 , say, then there exists a 
transformation that keeps x t and fixed and sends y 2 to an arbitrary y } 
(j ^ 1). Therefore k = n in contradiction to Theorem 5. 

Corollary If p\mn in a deficient system , then min(m, n) > 10. 

Proof This follows from Theorems 4 and 6. 

Now, let us study the products xy instead of the sums x + y. A first 
information can be obtained from [2], As a matter of fact, the following 
theorem is contained in [2] although it is not explicitly stated there. 

Theorem 7 Let K be a field , 0 i X c X, 0 $ Y c= K, \X\ = m, and 
\Y\ — n. If there is no element z e K represented uniquely as z = xy, x e X , 
and y e Y, then there are two natural numbers e and f and two pairs of distinct 
elements x u x 2 e X and y u y 2 e Y such that 

x\=x 2 and e < mm(2 m+n ~ 2 , n m ~ l ) 

and 

y{ = y{ and f < rnin(2 m+ " -2 , m n ~ 1 ). 

The definitions of multiplicative deficient and primitive deficient systems 
would be analogous. We can prove 

Theorem 8 Let K , K(x), K(y), K(xy ), and K(x , y) form a primitive 
(multiplicative) systems with [K x y :K] = g . Then x 9 e K and y 9 e K. 

Proof Instead of Eqs. (2), we have 

*. = *1 n yf Jt ’ i 4- = 0 (1 < i < rn). (5) 

j=i j=i 

If we choose one such equation for every i and multiply them out, we obtain 

M*) = *TlW, L h = o. 

J = 1 j= 1 

If we apply all (p e G = Gal(X y (x 1 )//C(x 1 )) to it and multiply the resulting 
equations, we get 

(N( X )r m =a fi (n 

j=i \<peG ) 
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since K y (x l ) = K Xiy by (5), <p(kj) does not depend on j and 

Yj= i = 0. The proof of y e e K is analogous. 

Let us remark that Theorem 6 does not hold in an arbitrary multiplica¬ 
tive deficient system. 

As in the additive case, we can define the regular primitive multiplicative 
systems by the condition 1 < k = n <m. Let us prove 

Theorem 9 In a regular primitive multiplicative system we have x m e K 
and y" e K. 

Proof The conjugates of x and y satisfy a system of m equations of the 
type 

x h yi = x i2 y 2 = = x in y„. 

If we multiply all of them combining terms containing the same y’s, we 
obtain 

N{x)?! = N(x)y = - = N(x) jC 

which implies y? = y“ = ■ ■ ■ = y? e K. Instead of Eqs. (2) we have Eqs. (5). If 
we raise them to the mth power, we get 

xf = x” 1 < i < m, 

thus x m e K. 

In difference to the additive case, we can prove 


Theorem 10 All primitive multiplicative systems are regular. 

Proof Let us assume that we have a primitive multiplicative system that 
is not regular. We can easily see that x m e K would imply the regularity of 
the system. Let m, > m be the minimal positive exponent such that x m 'e K 
and y mi e K. We may assume that we have a system with a minimal value of 
m and then also with a minimal value of m r . By Theorem 8, mj is not 
divisible by primes larger than m. Let p < m be such that p\m v Let us put 
X. = x f, Yj = y] for 1 < i < m and 1 < j < n. The expressions X t cannot be 
all equal since otherwise x p 6 K. Therefore, K(Xi) is a nontrivial extension 
of K of degree at most m. Since the equations for x f and y, are linked, the five 
fields K, K(X { ), K(Y { ), K(X { TJ, K(X U TJ form a multiplicative deficient 
system. Finally, X™' !p e K. Thus, the parameters of this new deficient system 
are certainly smaller than the parameters of the old system. Therefore, this 
new system must be regular. 

Conversely, let us start with this new regular system and put x p = X. 
This would force the old system to be imprimitive, which is a contradiction. 
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If K is an algebraic number field and 0 K the algebraic integers in K, let Ind(a) — [0 K : Z[<x]]for 
a 6 0 K . If m(K) = min ae0j[ Ind(a), it is shown that m(K ) is unbounded as K runs through cubic 
subfields of the cyclotomic fields of Ith roots of unity for l prime and that there exist infinitely 
many cyclic cubic fields K with m(K) = l We also compute m(K)for some specific examples. 

Let K be an algebraic number field and let 0 K be its ring of integers. 
Given a e 0 K , define the “index” of a, written Ind(a), to be the group index 
[0 K : Z[ a]]. This index is finite if a generates K over the rationals Q , and a is 
said to generate a “power basis” if this index is 1, i.e., 0 K = Z[a]. If = 1, 
(o 2 , ..., (o n is an integral basis for 0 K , then the index of a is the absolute 
value of the determinant of the matrix transforming the basis co l9 ..., co n to 1, 
a, a 2 ,..., a" - L If 5(a) = n^i ( a ” ^(a)) (where the product is taken over 
all nontrivial isomorphisms of K into an algebraic closure), then 
Norm x/Q (5(a)) = D( 1, a, ..., a" - x ) where £>(a l5 ..., a n ) is the discriminant of 
{a 1? ..., a„}. Hence 

Norm x/Q (5(a)) = Ind(a) 2 • D(<o u ..., (o n ) = Ind(a) 2 * D(K ), 
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where D(K) is the discriminant of K. Let 

a = X! • 1 + x 2 • co 2 +-1- x„ ■ a>„ . 

Since Ind(a) = Ind(a — Xi), we may assume x, = 0. Then 
<x‘ =fii ’ 1 + ••’ +/(„ ' 0)„ 

where the/ 0 - are forms of degree i in x 2 ,..., x„. Hence the determinant of the 
transformation matrix is a form of degree \n(n — 1) in the n — 1 variables 
x 2 , ..., x„. This form is called the “indicial form,” and we shall denote by 
m(K) the minimal value assumed by the form as a ranges over the integers of 
K. 0 K has a power basis if and only if m(K) = ± 1. 

The question of the existence of a power basis was originally examined 
by Dedekind [4]. It was demonstrated that the ring of integers in the cubic 
field generated by a root of x 3 + x 2 - 2x + 8 has no power basis. The 
Dedekind example led to the following result of Hensel [11]: 

Proposition The index of every element a e 0 K is divisible by the prime p 
if and only if 

S'#* 

for some integer f > 1, where g(f) denotes the number of prime ideals dividing 
(p) in K with residue class degree f (such primes are called “common index 
divisors"). 

We shall call the case [K : Q] = 3 of this result the Dedekind test. We shall 
be concerned with determining the minimal index for cubic subfields of 
prime cyclotomic fields. 

The cyclotomic field of Ith roots of unity K, contains a (unique) cubic 
subfield K if and only if / s 1 (mod 3). Let / be such a prime, and c any 
nontrivial solution of x' = 1. Then K, = Q(Q and the galois group g(K t /Q ) is 
cyclic of order / - 1, generated by the automorphism r: £ -*• C e , where g is a 
primitive root mod l. The cubic subfield K is then the fixed field of the 
subgroup <t 3 >. 

If we denote by a the relative trace from K t to K of (, then a is a generator 
of the field K,K = Q{ a). This follows from the fact that a is fixed by <t 3 > and 
cannot be rational since fewer than / — 1 of the elements 1 are 

linearly independent. 

Proposition 1 The element a = Tr K|/K (£)forms a normal basis for 0 K , i.e., 
the elements a, <r(a), o 2 ( a) (the K-conjugates of a) are an integral basis for 0 K . 

Proof It suffices to show that each y e 0 K can be written in the form 
y = act + bo (a) + c<r 2 (a) with a, b, c e Z. Since a generates K, y e 0 K implies 
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y = r<x + s<r(a) + ta 2 ( a) where r, s, t e Q. As also y e 0 Kl , considering this 
expression in and noting 0 Kl = Z[(] shows r, s, t e Z. 

To determine the arithmetic of the field K, it is convenient to find the 
minimal polynomial satisfied by a. The solution to this problem is afforded 
by the theory of cyclotomy. 


Proposition 2 Let l be a prime = 1 (mod 3). Then 41 has the (unique) 
representation 41 = c 2 + 21 d 2 , with c = 1 (mod 3), d > 0. With respect to this 
representation , the minimal polynomial for a = Tr kllQ (() is 


/«(*) = X 3 + X 2 


// — 1 \ lie + 31 — 1 \ 

l 3 ) X l 27 ) 


Proof The number a is a “cyclotomic period” of degree e = 3 (i.e., 
a = Yj a i(0> a i e < t3 ))- The period equation satisfied by a is [5, p. 393] 


1 + a 1 1 

(1, 0)a (1, 1) - a (1, 2) 

(2, 0)a (2, 1) (2, 2) - a 


= 0 


where the (i, /) are the “cyclotomic numbers” associated with e = 3. These 
numbers can be computed in terms of /, c, and d, where 4 1 = c 2 + lid 1 , 
c = 1 (mod 3). From Storer [14], 


18(1, 0) = 2/ — 4 — c — 9d 


18(1, 1) = 2/ — 4 — c + 9d 


9(1, 2) = / + c + 1 


( 2 , 0 ) = ( 1 , 1 ), ( 2 , 1 ) = ( 1 , 2 ), ( 2 , 2 ) = ( 1 , 0 ). 

Solving the determinant with these values yields the polynomial f x (x). 

The polynomial f x (x) determines the splitting of all primes in K. The 
prime / ramifies. Since discriminant/ a (x) = t 2 d 2 (an elementary calculation), 
the splitting of all primes p \ d is determined by the splitting of ffx) in Z/pZ 
[12, pp. 27-29]. If now p | d , then l 2 d 2 = 0 (mod p), hence f x (x) splits com¬ 
pletely mod p. Thus q, < r(«), <r 2 (a) e Z/pZ. But since a, cr(a), <r 2 (a) generate 
0 K over Z, a, <r(a), a 2 (a) generate 0 K /p over Z/pZ (p a prime lying above p). 
Hence/(p/(p)) = 1, and p splits (completely) in K. Thus the splitting of all 
primes is determined by f x . 

The theory of cyclotomy also allows ready computation of the indicial 
form. Using the integral basis 1, a, <r(a), and using the relations among the 
periods in Dickson [5, p. 393], we have 


P = xol + yo(tx) 

P 2 =f 2 1 ' 1 +/ 2 2 ' « +/23 ' <r(<x) = Aa + Bo( a) + Co 2 ( a) 
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where 


Thus 


A = (0, 0)x 2 + (0, 2)y 2 + 2(0, 1 )xy - (p - 1) 

B = (0, l)x 2 + (0, 0)y 2 + 2(0, 2)xy - (p - 1) 

C = (0, 2)x 2 + (0, l)y 2 + 2(0, 0)xy - (p - 1). 


lnd(p) = ± 


1 0 0 

Ox y 

-C A-C B-C 


Computing the determinant with 

( 0 , 0 ) = /—8 + c , ( 0 , 1 ) = ( 1 , 0 ), ( 0 , 2 ) = ( 1 , 1 ) 

we obtain the indicial form. We summarize the results in the following 


Proposition 3 Let p eO K . Then the index of P is given by 

±Ind(j6) = dx 3 + | ~ C ~ --jx 2 y + | c __ 3d j X y 2 + dy 3 

where 4/ = c 2 + 21d 2 , c = 1 (mod 3), d > 0, and P = xa + yo( a). 

This representation for the indicial form gives immediately an upper 
bound for m(K). For x = 1, y = 0, we have Ind(a) = d. Hence m(K) < 
(4//27) 1/2 . 

The question of the existence of a power basis is reduced to considering 
whether this form represents 1 (note that if the form represents ± 1 then it 
represents 1). Direct numerical tabulation of values represented by this form 
allows us to determine power bases in some cases, for example / = 241 
(x = - 1, y = 2), 373 (x = -2, y = 3), 379 (x = - 1, y = 3), 463 (x = 
— 1, y = 2), 751 (x= -l,y = 3). 


Proposition 4 2 splits ( completely) in the extension K if and only if 2 
divides d. 

Proof Since a, <r(a), <r 2 (a) are a Z-basis for 0 K , their images under the 
map O k ^>O k / p generate 0 K /p over Z/pZ. Suppose 2 divides d. Then 
discriminant(/ a (x)) s 0 (mod 2), so ffx) has a multiple root mod 2, hence 
splits completely mod 2 (a cu bic ha ving a multiple root has all its roots in 
the ground field). Thus a, <r(a), cr 2 (a) e Z/2Z, i.e., [0 K /p 2 : Z/2Z] = 1. Hence 
/(p 2 /(2)) = 1, and 2 splits in K. Suppose now that d is not divisible by 2 and 
2 splits in K. Then ffx) must split completely into distinct linear factors 
mod 2, clearly impossible. 
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Theorem 1 (Dedekind) If d = 0 (mod 2), then no power basis exists , 
and in fact every index is divisible by 2. 

Proof This follows immediately from the Dedekind test and the above 
proposition. We may also see this directly from the indicial form as follows: 
41 — c 2 + 21d 2 implies c and d have the same residue class mod 2. Hence if d 
is even, c is also. Writing the indicial form as 



and noting x 2 y + xy 2 = x 2 y - xy 2 = 0 (mod 2) for all values of x and y, we 
see that the form is always even. 

Remark By Hensel’s result above, 2 is a common index divisor in cubic 
fields if and only if 2 splits completely in the extension. By a result of 
Engstrom [6], 2 1 is the only possible common index divisor in cubic fields, 
e.g., 4 never occurs as a common index divisor. References [2,4,6,9,15] yield 
a fairly complete history of the results on common index divisors. 

Lemma 1 Let p be a prime = 1 (mod 3) and let p be a primitive third 
root of unity mod p. Let a be any nonzero residue mod p. Suppose a~ x d is a 
cubic nonresidue mod p and that c = (6 p 2 + 3 )d mod p. Then the indicial 
form does not represent a. 

Proof For c = (6p 2 + 3 )d mod p, the indicial form becomes 
/(x, y) = dx 3 + 3 pdx 2 y + 3 p 2 dxy 2 + dy 3 
since 1 + p = —p 2 mod p . Hence, 

/(x, y) = d(x + pyf mod p 

and the values assumed by the indicial form lie in the same cubic residue 
class as d. Since a and d lie in distinct cubic classes by assumption, a is not 
represented by the form. 

Using Lemma 1 we are able to construct an infinite class of fields that do 
not satisfy the conditions necessary for the Dedekind test but which never¬ 
theless do not have a power basis. This set of fields is constructed in such a 
way that the minimal index m(K) is unbounded as K ranges over the set. 
That the minimal index is unbounded over the set of pure cubic fields is a 
result of Hall [9]; the following shows that the same is true for the set of 
cyclic cubic fields. We shall need the following special case of the generalized 
Dirichlet theorem. For the proof as a consequence of the Tchebotarev den¬ 
sity theorem, see Bass et al [1]. 
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Theorem Let F be a number field . Given nonzero a 9 b e 0 F and a nonzero 
ideal & such that aO F + bO F = 0 F = 21 + bO F , there exist infinitely many 
primes p such that p9l — mO F for some m = a (mod bO F ). 

Theorem 2 Given any N > 0, there exists a cubic subfield K of a prime 
cyclotomic field K t = Q((), ( a primitive Ith root of unity , such that m(K) > N 
and 2 remains prime in 0 K . 

The condition that 2 remain prime in 0 K , by Proposition 4, implies that 
the Dedekind test for the nonexistence of a power basis does not apply, i.e., 
there is no common index divisor. 

Proof Let p u ..., p N be distinct primes = 1 (mod 3). Let p, be a primi¬ 
tive cube root of 1 mod p t . Choose d t so that i~ 1 * d, is a cubic nonresidue 
mod p i9 and let c t = (6 pf + 3)^. By the Chinese remainder theorem, let c 
and d be solutions to the following system of congruences: 


c = d = 1 (mod 2) c = c { (mod p t ) 
c = 1 (mod 3) 3 d = d { (mod pf 

Let now F — Q(v^3) in the Dirichlet theorem above, and set 

a = c + T b = 6Y\ Pi , 91 = 0 F . 


Since 


i= 1 

xt C 2 + 21d 2 2 /(6pf + 3) 2 + 3\ 2 

Norm f/Q (a) =---= d?l--^-1 mod p ; = -6 df mod p f 

and di # 0 (mod p f ), it follows that (a) is relatively prime to (f]fL, p ; ). Also 

Norm F/c (a) = 1 (mod 6) 
so (a) is also prime to (6). Thus 

(a) + (b) = 91 + (b) = 0 F , 

and by the Dirichlet theorem, there exists a prime p such that p = ( m ), where 
_c + 3d^J — 3 


m = 


mod^nP,)- 


If the prime p were of degree 2, then p = (p) = (m), where p is a rational 
prime. Hence p ~ m in 0 F . The units of F are ±1, ±co , and ±co 2 , where 
co = + x/—3). Hence we would have 


m 


^+3dyjsj +6 np.(- + y~^ ) 
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with x, y e Z, and such that 

m = ± p, ± ± pco 2 . 

The first is impossible since 

3 d + 6yY\Pi = 1 ( mod 2 )> 

i = 1 

so m cannot be real. The latter two cases would imply 

±P ± Psf^?> 

m ~ 2 

which is also impossible since 

+ |c + 6x n Pij = ± ^3 d + 6y n Pi) 

is impossible mod 3. 

Thus the prime p is of first degree, so 

/ = Norm F/Q (p) 

(c + 6 n?= i Pi * x) + (3d + 6 []?= i Pi * y) r~r\ _ 52 + 27f2 
2 V / 4 

is a rational prime with 

s = t = 1 (mod 2) 

s = c f (mod p,), t = di (mod p,). 

Let K be the cubic subfield of the prime cyclotomic field /C* for this /. The 
numbers 1,..., N do not occur as indices by Lemma 1, and 2 remains prime 
in K by Proposition 4. 

The indicial form and the integers it represents can also be connected 
with a certain norm form in the cubic field K. Letting /? = xcc + yo( a), we 
know 

D( 1, P, P 2 ) = Ind (P) 2 ■ D(K), 

or since D(K) = l 2 , 

Ind (P) 2 ■ l 2 = N(d(P)). 

The prime / is totally ramified in K, and the prime sitting above / in K is 
principal, say 

/ = P 3 = (r) 3 

(y can be taken to be the relative norm from to K of 1 — £)• Since l is 


= Norm f/Q 
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totally ramified, we can also write 

l 2 = Norm K/Q (y ) 2 = Norm(y<x 2 (y)). 
Letting S((j) = (ft - ct(/?))(/? - cr 2 (/?)), we have 

T j/ o \2 Norm(/J — a(P))(fl — <? 2 (P)) 
Ind(W ‘ -Norm(,»»(,)) 


= | N orm(t^)|.|Norm(t||« 

econd norm, we find that 

, orm (t£!«)|.| N orm(^ffl)| 


Applying a to the second norm, we find that 

lx-- 


Norm 


so that 

IndO?)= Norm K/Q |- ~^)| • 

Note that as l is totally ramified, the inertial group of p = (y) is the full 
galois group, so fi = <x(/?) mod p for all a. Hence the element (/? - <x(/?))/y is 
an integer. We write this as 

Proposition 5 Ind(/?) = |Norm K/Q (^)| where £, = (j 8 — <r(/?))/y e 0 K . 

By this proposition, the existence of certain indices implies certain rela¬ 
tions in the class group of K. In particular, in the case of prime indices, we 
have the following result. 

Proposition 6 Suppose the prime p =f= lisan index in K. Then p splits in K 
into three principal factors. 

Proof By Proposition 5, p = Norm({) for some £ e 0 K . Since p is 
unramified in K, p = p, p 2 p 3 , with/(p,-/(pj) = 1- Further, (c) = p ; for some 
i, is principal, and since the p, are conjugate under the Galois group, each p, 
is principal. 

If the structure of the class group is known, Proposition 6 is useful in 
eliminating certain possible values for m(K). For example, if / = 277, then 
m(K) = 2 or 4. Determining the splitting of all primes p < 7 / to determine 
the class group structure (the constant 7 is due to Davenport [3]), using 
f a (x) = x 3 + x 2 - 92x + 236 (see the comment following Proposition 2), 
shows that only 2, 13, 19, and 37 split in K. Since Norm(a — 4) = 2 2 • 13, 
Norm(a — 8) = 2 2 • 19, and Norm(a — 5) = 2 • 37, we have 

Ck — <[P 2 ], [pi]) 

where C K is the class group of K and [p 2 ], [p 2 ] are the classes of two 
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(conjugate) primes lying above 2 in K. Since the class number of K can be 
computed by analytic techniques, and h(K) = 4 for this field [ 8 ], p 2 cannot 
be principal. Hence by Proposition 6 ,2 is not an index, so m(K) = 4. Similar 
calculations hold for the prime / = 397, but for all other undecided m(K) 9 
l < 1000, h(K) = 1, so Propositions 5 and 6 are of no help. 

We remark that the converse of Proposition 6 does not always hold, for 
example in the case / = 499, where h(K) = 1, and 2 splits into necessarily 
principal factors, and yet 2 is not an index (congruence considerations of the 
indicial form mod 9). 

Let now n be a square-free integer, not necessarily prime. If 
n = 1 (mod 3), then the cyclotomic field K n has a cubic subfield K of discri¬ 
minant n 2 . By Hasse [10], the element a = (—l ) r+1 Tr Kn/K (£) generates a 
normal basis for 0 K , where r is the number of distinct primes dividing n and 
C is a primitive nth root of unity. Further, a has minimal polynomial 

m .**+*>- - ( - + l; - - ) 

where 4n = c 2 + 21 d 2 , c = 1 (mod 3) as in the case for n prime. 

Theorem 3 There exist infinitely many cubic subfields K of cyclotomic 
fields K n for which m(K) = 1, i.e., a power basis exists for the ring of integers in 
K. 

Proof The index of a defined above is given by (D(f a (x))/D(K)) 1/2 9 and a 
straightforward computation of D(f a (x)) shows a has index d. Hence it 
suffices to show there exist infinitely many square-free integers n such that 
4n = c 2 + 27, or equivalently, n = k 2 + k + 7 (see Narkiewicz [13]). Let N x 
denote the number of square-free integers of the form k 2 -I- k + 7 for k < x. 
Then N x >x — M x , where M x denotes the number of integers of the form 
k 2 + k + 7 for k < x and having a square factor. Hence, 

M x = card{/c <x|/c 2 + /c + 7 = 0 (mod p 2 ) for some prime p}. 

We may clearly take p < x. Since the congruence k 2 + k + 7 = 0 (mod p 2 ) 
has at most two solutions on any residue system except for p — 3, we have 

where 9 p < 2. Hence 

N x > xl^ — 2 X 4) - 2 X 1 = Cx + o(x) 

W p^5 P / p£x 

Since C — j — 2 5 p~ 2 > 0, we have N x -► oo as x -► oo, concluding the 

proof. 
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The determination of m(K) for any particular case reduces to solving a 
finite number of cubic Diophantine equations, namely /(x, y) = a, where 
/(x, y) is the indicial form and a is a positive integer less than (4//27) 1/2 . In 
conclusion, we determine m(K) for the cubic subfield K of K 181 . We let N(A) 
denote Norm KIQ (A). 

For l = 181, we have c = 7, d = 5, and the indicial form for this field is 
/(z, y) = 5z 3 — llz 2 y — 4zy 2 + 5y 3 . 

The prime 3 remains prime in this extension, so by Proposition 6 , cannot be 
an index. Further, for d odd, m(K) is also odd. (Proof: The indicial form 
mod 2 for d odd is either x 3 + x 2 y + y 3 or x 3 + xy 2 + y 3 since one of 
— c— 3d and c — 3 d is = 0 (mod 4) and the other is = 2 (mod 4). Hence, an 
even index is represented only for x = y = 0 (mod 2). But then 
(xa + yn(a))/2 is an integer with smaller index, a contradiction. Hence m(K) 
is odd.) Thus m(K) = 1 or 5. We shall show m(K) = 5 by showing/(z, y) = 1 
for integers z and y is impossible. 

Assume/(z, y) = 1, for z,yeZ. Then letting x = 5 z, we have 

25 = 25 */(z, y) = x 3 — llx 2 y - 20xy 2 + 125y 5 . 

By Proposition 5, / (x, y) is always reducible in K[x , y], and in this case we 
have 

(25) = (x- 0y)(x - o(6)y)(x - <r 2 (0)y) (1) 

where 6 — a + 4. Define p 5 by 

(a-<r(a)) = p 5 p 181 

(recall Ind(a) = d = 5). Since N(6) = — 125, (6) is divisible by three primes 
above 5. Also N(0 + 5) = — 5 2 • 7. If (0) were divisible by the three distinct 
primes above 5 in K, then (5) | (0 + 5) (| denotes divisibility), which would 
imply 5 3 1 N(9 + 5), a contradiction. If (0) were the cube of a prime above 5, 
then N(0 + 5) would not be divisible by 5 2 . Hence (0) = q, q' 5 , where q 5 , q' 5 
are two distinct primes lying above 5 in K. Since (0 — <r(0)) = (a — <r(a)), we 
have by definition of p 5 , p 5 1 (0 — <r( 0 )), and considering the possibilities for 
q 5 and q' 5 , we see that only ( 0 ) = p| p's and ( 0 ) = (p's) 2 p 5 are possible, where 
p ' 5 = <r(p 5 ) and p '5 = <r 2 (p 5 ). In either case, since 51 x, we have 

p 5 Ps I (* — y 0 ) 

and since N(x — yO) = 25, 

(x - y6) = p 5 p, . ( 2 ) 

An integral basis for this field is given by a, <r(a), and <r 2 (a), or in terms of 
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the computationally easier 1, a, a 2 , 


cr(a) = | 


38 - 2a 
5 


'26 + 21a — 2a 
t 5 


lamental units for 

-)■ -f 


— 43 — 3a + oc : 


There are two conjugate fundamental units for this field, given by 
(26 + 21a - 2a 2 \ /54 - 16a - 3a 2 \ 


(these units were originally calculated by the method of Godwin [7], and are 
the same units given in the tables of Gras [8]). 

We may calculate p 5 explicitly from (a — <r(a)) = p 5 p 181 to be 

(2 — 8a 4- a 2 \ 


with conjugates 


M- 


+ 7a + (x 2 \ 


f 126 + a — 2a 2 


Hence, from Eq. (2), we have 

x — y9 = e?£ 2 (i ) 2 (2 — 8 a + a 2 )(126 + a — 2 a 2 ) 

x - y<r(6) = n 2 ~ m {\) 2 (l + 7a + a 2 )(2 - 8 a + a 2 ) 

x - yo 2 {6) = eT ” n £2 ”(i) 2 ( 126 + a - 2a 2 )(7 + 7a + a 2 ) (3) 

where we have used the fact that and s 2 are conjugates to obtain the 
second two equations, and where m and n are integers. 

We shall show the equations in (3) are impossible by considering them 
first modulo p 7 (a prime lying above 7 in K) and then modulo p 19 (similarly 
a prime lying above 19 in K). Note that N(oc — 2) = 5 2 ■ 7 and N(<x — 1) = 
5 • 19, so that both p 7 and p 19 are first degree primes in X, which may be 
defined as the kernels of the homomorphisms 0 K -► Z/7Z (by a -► 2) and 
0 K -► Z/19Z (by a -► 7), respectively. 

Modulo p 7 , we have, since a = 2 (mod p 7 ), 

e l = 5 (mod p 7 ), e 2 = 2 (mod p 7 ) 


|2 — 8a 4- a 2 j 

ll + 7a + a 2 \ 

\ ^ 

126 + a - 2a 2 \ 


5 (mod p 7 ) 


= 5 (mod p 7 ) 


3 (mod p 7 ) 


6 = a + 4 = 6 (mod p 7 ), ct( 0) = 3 (mod p 7 ), 
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and 

<x 2 ( 0 ) — 2 (mod p 7 ). 

Hence Eq. (3) becomes 

x + y = 5"2 m = (- l)"2 n+m (mod p 7 ) 
x-3y = 5~ m 2 n ~ m 4 = (- l) m 2"" 2m 4 (mod p 7 ) 

X-2 y = 5 m ~ n 2~ n = (_ l y + m 2 m-2n ( mod ( 4 ) 

Then 


x — 2y 
x + y 


= (-l) m (mod p 7 ). 


x -2 y 
x — 3 y 


= (- 1 )" • 2 (mod p 7 ) 


imply that m is odd and n is even (m even implies y = 0 (mod 7), and the 
indicial form does not represent 1 for such y , and m odd implies 
x = 4 y (mod p 7 ), which shows (x — 2 y)/(x — 3y) = 2 (mod p 7 ), i.e., n is 
even). 

We now consider Eqs. (3) mod p 19 , where a = 7 (mod p 19 ). We have 
here 

e x = 15 (mod p 19 ), e 2 = 16 (mod p 19 ) 

8 a + 




(7 + 7a + a 2 


f 126 + a — 2 a 2 


18 (mod p 19 ) 
) = 2 (mod p 19 ) 
j = 7 (mod p l9 ) 


0=11 (mod p 19 ), <t(0 ) = - 1 (mod p 19 ), o 2 (0) = 1 (mod p 19 ). 

Here we have 

x — 1 l_y = 15"16 m 12 (mod p 19 ) 
x + ^ = 15 _m 16 n_ '"17 (mod p 19 ) 
x - y = 15 m— ”16 _ "14 (mod p 19 ). 

Since n is even, m odd, set n = 2k t and m = 2k 2 + 1- Then the first two 
equations become 

x — lly = (15 2 ) kl (16) 2 ' 12 • 2 (mod p 19 ) 
x + j = (15 2 ) _fc2 (16) 2fcl _ 2k2 • 3 (mod p 19 ). 

Now, 2 is a quadratic nonresidue mod p 19 , as is 3 (the quadratic residues 
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are 1, 4, 5, 6, 7, 9, 11, 16, 17), so both x - lly and x + y are quadratic 
nonresidues. Tabulating the pairs (z, y ) for which/(z, y) = 1 (mod 19), we 
find that for no pairs are 5z — 1 l v and 5 z + y both nonresidues. Hence the 
equations (3) are impossible, and m(K) = 5. 

Table 1 lists the indicial form coefficients (as dx 3 + Ax 2 y + Bxy 2 + dy 3 ), 
and the known m(K) for / < 1000. In the cases where several indices are 
listed, they are the only possible values for m(K) but it is undetermined 
which is correct. Except for the case l = 181 above, these values were ob¬ 
tained by Proposition 6 (using the tables of Gras [8] for the class numbers) 
and by congruence considerations of the indicial form. Note that c has been 
normalized positive in the table and that this does not affect the values 
assumed by the indicial form. 

Finally, we wish to acknowledge the assistance of George Cooke for 
helpful suggestions and comments. 
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It has been shown by the authors that any two integral quadratic forms 
in the same genus over the rational number field which are not properly 
spinor equivalent do not become properly spinor equivalent in any odd- 
degree extension of Q (see Earnest and Hsia [7]). But examples given in 
Earnest and Hsia [4, 6] have indicated that this type of behavior cannot, in 
general, be expected for quadratic extensions. It is the purpose of this paper 
to give constructive methods for determining the amount of collapsing of 
proper spinor genera in quadratic extensions of Q. Additionally, several 
results are obtained giving information about the number of proper spinor 
genera in the genus of the lifted lattice. 

The vehicle used in studying the lifting behavior of spinor genera is the 
correspondence, first described by Kneser in [11], between the proper spinor 
genera and a factor group of the idele group of the coefficient field. The 
noncollapsing of the proper spinor genera in odd-degree extensions can be 
established by showing the injectivity of the natural lifting map between the 
corresponding idele groups. In the case of quadratic extensions, however, the 
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kernel of this map may be nontrivial and it is this kernel that we shall 
compute in Section 1. Of course, in order to determine the idele groups 
involved, it is first necessary to compute the local spinor norm groups of 
the lattice under consideration. This can be carried out for the spots on 
the rational number field using techniques from Kneser [11] and Earnest 
and Hsia [5]. However, these spinor norms have not previously been 
determined for the dyadic spots on quadratic fields. While the general prob¬ 
lem of calculating the spinor norm groups over arbitrary dyadic local fields 
is extremely difficult, it is possible to use the strong local condition that 
the ramification degree is less than or equal to 2 to extend the results of Hsia 
[9] and Earnest and Hsia [5] to perform the necessary calculations for qua¬ 
dratic fields. This is done in Section 2. 

The third section of this paper shows several applications of the 
techniques of the first two sections. Examples of /-lattices and quadratic 
extensions are constructed showing that the number of spinor genera which 
collapse may be any arbitrary 2-power. We give a condition sufficient to 
assure that, while distinct proper spinor genera from the genus of a /-lattice 
L may collapse in the genus of the lattice L lifted to a quadratic extension, 
the number g+{L) of proper spinor genera in gen(L) is no smaller than the 
number g + (L) in gen(L). The condition is satisfied, in particular, when there 
is no rational prime that divides both the discriminant of the field extension 
and the determinant of the lattice. Finally, the computations appearing in 
Section 2 are applied to give sufficient conditions, in terms of the reduced 
determinant of L, for the number g + (L) to be smaller than or equal to the 
order of the 2-Sylow subgroup of the ideal class group of the quadratic field. 
Of course, if the class number of the field is odd, this gives g + (L)= 1. 

The techniques described in Section 1 are easily adaptable to the use of 
computer methods for the calculation of the kernel of the lifting maps. A 
sample of data obtained with the use of the IBM System 370 at the Ohio 
State University is contained in the first author’s doctoral dissertation [2]. 
Further results will appear in a future paper, written jointly with John H. 
Yang. 

We now recall briefly the notation established in our previous papers 
dealing with spinor genus behavior. Unexplained notations can be found in 
O’Meara [12] and Earnest and Hsia [6]. With the exception of Section 2, L 
will denote a /-lattice and L = L <x) z o the lattice lifted to the quadratic 
extension E = Q (>/m) (here o denotes the ring of algebraic integers of E). 
The proper spinor genera in the genus of L correspond bijectively to the 
elements of the factor group J V /P V J' V J L , where V denotes the quadratic 
space on which L is defined. The spinor norm map induces an injective map 


Jv/P vJvJl Jq/PdJq — &q(L) 
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which is in fact an isomorphism when dim L > 3. The corresponding groups 
can be considered over the extension E and for any prime spot p on E lying 
over p, the lifting maps g: J Q -+J E and h: J v -+ J defined respectively by 


(g(j p ))p = j P and (h{<T p )) p = o p ®E v 


induce homomorphisms i)/ L and V L in the commutative diagram 


J v/P pJ'pJi 

Tl 

J v/P vJ vJ l 


<t>L 


'E(D 


4>l 




ML)- 


Upon determining Ker(i//, ), it is possible to relate directly back to the col¬ 
lapsing behavior of spinor genera by using the method outlined in Appendix 
B of [7]. 


1. Computation of Ker(i^ L ) 

The problem of computing Ker(i \j/ L ) is twofold. First, we need to identify 
a set of elements of J Q whose classes modulo P d Jq give a complete list of the 
elements of (4’ U (L) (since dim L > 3, this set is finite). Second, we need an 
effective way to determine for a particular j e J Q whether i p L (j) = 
(T) e ^ £ (L) or, equivalently, whether h(j) e P&J\ (h denotes the map of J Q 
into J E that induces i)/ L ). These two questions are closely related and will be 
treated with similar techniques. To determine whether j equals (T) in ^ Q (L), 
we need to check whether there is some element a of D for which ccj e Jq. 
Since there are an infinite number of possible choices for such a, the problem 
is essentially that of finding some finite subset of elements of D that represent 
all possible cosets of the elements of D modulo Jq . The second problem can 
be likewise reduced to finding a finite subset of elements of D that represent 
all possible cosets of elements of D modulo J E . 

We deal first with the question of determining representatives for the 
elements of &q(L). There are two subsets of the set S consisting of all finite 
rational primes; these are related to the lattice L and are important for this 
investigation. Denote the set of “exceptional” primes for L by 

<?(L) = { P eS:0(O+(L p ))^u p Q 2 P }. 

We distinguish also a subset of (L) that consists of those primes that will be 
used as “multipliers”; let 

Jt(L) = { P eS:6(0 + (L p ))£ u p <^}. 
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Note that Jt{L) £ <?(L) and that both sets consist of only finitely many 
primes. The significance of these sets lies in the following proposition. 


Proposition 1.1 (i) Let L be definite and j e Jq . Then j = (1) e ^ 0 (L) if 

and only if there is some m = Y[pe m(L) P tp > with £ p e {0, 1 }for each p,for which 


mj p e 9(0 + (L p )) for all p e S(L). 

(ii) Let L be indefinite and j e Jq. Then j = (T) e & Q (L) if and only if 
there is some m = ±Y[ P sm(d P Ep > with e p E {0» 1} f or eac h P>f or which 

mj p e 9(0 + (L p )) for all p e <f(L). 


Proof of ( i ) (<=) For p e S(L), we have mj p e 9(0 + (L p )) by assump¬ 
tion. For all p discrete, p $ $ (L), we have me n p and j p e u p by choice of j 
and by the form of m. Thus, for such p, mj p e u p ^ n p 0 P = 9(0 + (L p )) by the 
definition of $(L). So, mj e Jq and j e P d Jq. 

(=>) Since /= T, there is some ot e D for which ocj e Jq. Write 
a = Yipes P Vp > v p E 2. For any p £ Jt(L ,), v p must be even since j p e n p and 
9(0 + (L p )) gu p Qj.Soa can be rewritten a = /[ pe ^ (L) P Epol o where 


a 0 eCl and 



if v p even 
if v p odd. 


If «' = ftp e m{L) P Cp , then a 'j e as desired. 

The proof of (ii) is identical except that in the indefinite case P D = P Q 
and so — 1 must be included as a possible factor. QED 


Consider an arbitrary nontrivial element of ^ Q (L), say /for some j e J Q . 
The representative j may be changed via multiplication by an element of P D 
without changing the class / So k = j where k e Jq is defined by 
k = Yipes p' ordpUp) j. Thus, a representative of each element of &q(L) can be 
chosen from Jq. Furthermore, for j, k e Jq, if j p e fc p O p for all p e S(L), 
then /= k. Define the set £’(L) by <T(L) = {p e S(L)\ 9(0 + (L p )) ^ 0}. So it 
suffices to consider only the square classes of the coordinates at the primes p 
in <T(L). Thus, a set of representatives of the nontrivial elements of &q(L) is 
contained in the set of elements j e Jq satisfying 


.11 if p£<f(L) 

Jp ~ |A P if pe S\L) 


(*) 


where, for p odd, A p is either 1 or A p (the fixed nonsquare unit of Q p ), and k 2 
is one of the elements 1, 3, 5, or 7. Moreover, to determine whether two such 
ideles represent distinct classes, it is sufficient to check whether there exists 
some m = ±I1 p€uwl) P £p > with e P E {0> 1} anci -1 is used only when L is 
indefinite, for which j p e mk p 9(0 + (L p )) for all p e <T(L). 
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The procedure for determining a list of representatives of the classes of 
y Q (L) can now be outlined. After determining the set S'(L), we consider the 
list h u ..., h r of elements of J q described by (*). We wish to eliminate 
redundancies from this list to obtain exactly one representative for each 
distinct class modulo P d Jq . Let m 1 = 1, m 2 ,..., m s be a list of the integers of 
the form +Y\ pg m{d P Ep > where e p e {0, 1} and — 1 is used only when L is 
indefinite, which are pairwise distinct modulo J® . Then m 2 h u ..., m s h x are 
equivalent to distinct elements of the list h 2 ,..., h r . These elements can be 
removed from the list because m i h l = h l . Let u be the smallest index j for 
which hj has not been removed from the list; eliminate all the multiples 
m 2 h u , ..., m s h u . Eliminating successively in this manner, a list of elements 
distinct modulo P d Jq is obtained in a finite number of steps. 

Example 1.2 Consider the lattice L = <1, a 2 , 7a 4 >, where 

a = 3 • 5 * 19. 

Local computations show that 9(0 + (L p ))= u p Q 2 for p =/= 2, 3, 7, or 19, 
0(O + (L 3 )) = G|, 0(O + (L 7 )) = Q 7 , 0(O + (L l9 )) = 0? 9 , and 

0(O + (L 2 )) = O 2 . 

Thus, ^(L) = {2, 7}, <^(L) = {2, 3, 7, 19}, and <T(L) = {3, 19}. A list of pos¬ 
sible representatives of nontrivial classes of ^ 0 (L) is formed by j, k, and jk 
where j and k are defined by 

. = |A P if p = 3 _ |A P if p = 19 

Jp |l if V + 3; p |1 if p 19. 

The list of integers generating principal ideles which are possibly distinct 
modulo Jq is 1, 2, 7, and 14. Checking coordinatewise, we see that (2) = 
jk (mod Jq) and (7) = (1) (mod Jq). So jk = (T) and j = (2)fc (mod J^) gives 
j = k. Hence, | ^ Q (L) | = 2 and j is a representative of the nontrivial class. 

We now turn to the question of determining for a particular nontrivial 
element j of & Q (L) whether or not ^ L (j) is a nontrivial element of & E (L). It is 
helpful to introduce subsets of the set T of all discrete prime spots of E , 
which correspond to the set $(L) and J/(L) in Q. Let 

<?(£) = {peT: 0(O + (L p ))^u v El} 

and 

^(£) = {per: 0(O + (L P ))£ xx p E 2 p }. 

Remark 1.3 The set J ((£) can be determined from J({L). Let p be a 
rational prime and p a prime spot on E with p | p. We make several 
observations: 
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(i) If p \ disc(£/<Q), then 0(O + (L p )) £ UpO 2 implies 6(0 + (L f }) £ 
u p £ 2 . Hence, p e .Jf(L) implies p e Jt{L). 

(ii) If p is odd and p|disc(£/Q), then 9(0 + (Q) <= u p £ 2 p ; hence 
p 4 Jt(L). 

(iii) w hen p = 2 , p|disc(E/Q) implies 0 ( 0 + (L p ))c Up £ 2 unless 
£ = Q(y/m) where m = 3 (mod 4) and L 2 is split by a sublattice of the type 
2 r <e, 2S}, r e Z, e, S e u 2 . This will be verified in Section 2. 

We shall now describe the square classes of those elements of D that have 
even order at all the discrete spots on £. If £ is a real quadratic field, let e 
denote a fundamental unit of £; if £ is imaginary let e denote the element — 1 
except when £ = Q(y/— 1 ) or 0(^—3), in which case e = i or e = — co 3 , 
respectively, where co 3 denotes a cube root of unity. 

Proposition 1.4 Let a. e E with ord p (a) even for all p e T. Let p u p t 
be all the rational primes dividing disc(£/Q). Then 

t 

a € + e’’ 0 J"[ pI'E 2 

i — 1 

for some tj ( e { 0 , 1 }, i = 0 , 1 ,..., t. 

Proof Consider the ideal ao £ = 31. Since ord p (a) even for all p e T, 
there is some ideal © for which 31 = © 2 . If S is principal, say © = /?o £ , then 
a = P 2 ' y> y £ i*e • Thus, a € +e ’ w £ 2 for ;/ 0 = 0 or 1. If © is not principal, 
then © is a class of order two in the ideal class group of £. Every ideal class 
of order two can be represented by an ideal of the form }”[■= j pj", e { 0 , 1} 
where p t , p, are the ramified primes of £ (for a discussion of this well- 
known result see, e.g., [10]). It follows that 

® = and 31 = © 2 = (nPr)t 2 o. 

where p,|p,. 

So a = yr 2 ([7J =1 pf) with y e u E ; that is, a e (±£ ,, 0 )(n!=i pf)E 2 for t] 0 = 0 
or 1. QED 


Corollary 1.5 Let Lbea Z-lattice with dim L > 3 and letoc e E for which 
ord p (a) is even for all p e T. 

(i) If L is indefinite , if m < 0, or if m > 0 and N E/Q (e) = +1, then 
a g D if and only if a e ±£*° (iW)* 2 , % 6 { 0 , 1 }. 

(ii) If L is definite, m >0 and N e/q (e) = — 1 , then 


a € D if and only if a e 


±(npp)A a . 


hi e {0, 1}. 
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Returning now to the computation of Ker(i f/ L ), we take a nontrivial 
element /of % Q (L). As in previous arguments, we may choose the representa¬ 
tive j from Jq . If a 6 D is such that ah(j) e j\, we must have ord„(a)even for 
all p $ Ji(L), but the order may be arbitrary at those p in Jt(L). So in 
addition to the elements a described in Corollary 1.5, we must now also 
consider those a possibly having odd order at some subset of the spots in 
J((L). 

For a set Q j,..., Q s of spots in Jf(L) we wish to describe those elements 
a of £ satisfying 


ord Qj (a) odd for i = 1 , ..., s 

ord p (a) even for p / , i = 1 , ..., s. 


(**) 


If a and /? are two elements both satisfying the conditions (**), then ord p (a/?) 
is even for all p e T. So the square classes of a and /? differ only by a product 
of those elements obtained in Proposition 1.4. That is, in order to describe 
the square classes of elements satisfying (**) it suffices to obtain only one 
such element. The question of the existence of such an element can be settled 
by an evaluation of Hilbert symbols. 


Proposition 1.6 Let Q l5 be spots in Jf(L) with & l9 Q s , not 

inert in the extension E/Q. The following statements are equivalent: 


(i) There exists an element ole E satisfying (**). 

(ii) ± Yl<k e N e/q (E) where q t I = Q t n Q. 

i = 1 

(iii) (±fi [q, , m j = + 1 for all p e fi Q . 

(iv) !± n < 7i’ m j = +lfor p|disc(£/Q) and p = 2, q u ..., q s ,. 

Proof (i)=>(ii) Let a satisfy (**). Using the formula 

ord p (N £/Q (a)) = £/(p|p) ord„(a), 
v\p 

it follows that ord p (N £/Q (a)) is even for those p for which p f q t for any i = 1 , 
..., s'. So consider p = q t for some i = 1, ..., s'. If p ramifies in £, then 
/(p | p) = 1 and ord p (JV £/Q (a)) is odd. If p splits in E, let p, p denote the prime 
spots lying over p. First, if p = Q,, p = Q; for some i,j= l,..., s' with i f j, 
we get ord p (N £/Q (a)) = ord Qj (a) + ord a ,(a) is even since both ord Qi {a) and 
ord 0 / (a) are odd. On the other hand, if p f Qj for any j — l, ..., s', then 
ord p (JV £/Q (a)) is odd since ord p (a) is odd and ord f (a) is even. Thus, we can 
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write N e/q ( a) = ±[7 P es P £p where e p is odd for those p for which p = q t for 
some i = 1 ,..., s' and p f qj for any i j, and s p is even for all other primes 
p. The element fi has the desired norm where fi = ya with y defined by 

I ( £ p — 2)/2 if p = qi = qj , i =j= j 

y=Y[p~ Vp with Vp = | (e p — l )/2 if e p is odd 

peS j 

' s p /2 otherwise. 

(ii) => (i) Suppose fie E with N E/Q (fi) = + f[f= i q t ■ Distinguish a set U 
of primes by the following: If p = q t for some i = .s' + 1,..., s and ord a .(fi) is 
even then pet/; if p = q t = q, for i fi j and ord c _(/i) is even (and thus 
ord a,(/0 is also even), then pet/; otherwise p i U. The desired element a 
satisfying (**) is given by a = xfi where 


t= Up\ 

peS 


10 if p £ U 

Vp- (l if pet/. 


(ii) o (iii) Let y denote the element ± f]f= i . Then y e N e/q (E) if and 
only if y is represented over Q by the quadratic space [ 1 , —m\, that is, 

[1, -m] s [y, —ym]. By the classification of rational quadratic forms this 
Q 

isometry condition is equivalent to the equality of the local Hasse symbols 
S p ([l, -m]) = S p ([y, -ym]) for all p e £ 2 Q . A calculation of these symbols 
shows that this condition is in turn equivalent to the condition (y, m) p = +1 
for all p e fi Q . 

(iii) o(iv) These conditions are equivalent because (y, m) p = 1 for all 
odd p that do not divide either m or y. QED 


For the purpose of the present problem, we consider those elements 
satisfying the conditions (**) where the set Q,, ..., Q s varies through all 
possible subsets of the set.//(L). For each of these subsets obtain one ele¬ 
ment of E satisfying these conditions (if such an element does in fact exist); 
let a lt a u be a list of these elements. Always choose a f to have 
^E/o(^i) > 0 whenever possible and arrange the elements in such a way that 
^£/o( a i) > 0 for i = 1 , ..., u' and N Ef9 (ai) < 0 if i = u' + 1 ,..., u (of course, 
u' = u if m > 0 with iV E /Q(e) = — 1 or if m < 0). Our discussion can be sum¬ 
marized in the following theorem. Recall that p l5 ..., p ( denote those rational 
primes that divide disc(£/Q). 

Theorem 1.7 Let L be a Z-lattice with dim L > 3 and L 2 modular. Let 
/f 1 £ ^q(L). i fi L (j) = T € $e(L) if and only if there exists ct.eE for which 
(ct)h(j) e j\ and: 

(i) If L is definite, m > 0 and N E/Q (e) = +1, a is a product of elements 
from the list 1 , e, p v ..., p,, <x u ..., a„,. 
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(ii) If L is definite, m > 0 and N E/Q (e) = -1, a is a product of elements 
from the list 1 , p u .... p ,, a t ,.... a„,. 

(iii) If m <0 or if L is indefinite, a. is a product of the elements 1, e,p u ..., 

Pt, <*i, a„. 

Example 1.8 Consider the lattice L of Example 1.2. The map t // L is 
injective if and only if is a nontrivial element of 0 £ (L). We consider 
several extensions E. 

(i) E = 0(^3). We have i // L (j) = (2)t // L (j) = i l/ L (fi). Furthermore, 
A 19 e E\ for p = 19o, since 19 is inert in E/Q. Thus, i — (1) and \j/ L is 
not injective. 

(ii) E = 0(^5). Since 3 is inert in E/Q, we have A 3 e E% for p = 3o 
and \I/ L {]) = (T); again \j/ L is not injective. 

(iii) E = Q(y/l). According to Theorem 1.7 we need only check the 

multiples of i// L (j) by products of the elements 1, 2, 7, e, and /? where 
e = 8 4- 3y/l and N E/Q (p) = 2. First consider e. Since Jl = ± 1 in 
Q 3 ^ Z/(3) and = ±8 in Q 19 ^ Z/(19), ^ is seen to be a nonsquare at 
each of the spots lying above 3 and 19. So (£)iAl(J) (!)• We may choose 

/? = 3 + y/l. Local calculation shows that /? is a square at one spot lying 
over 3 and a nonsquare at the other. Thus, (P)^ L (]) (I). Hence, ^ L (J) i= (T) 

and \)/ L is injective. 

2. Spinor Norms in Quadratic Fields 

The problem of computing the spinor norms of local integral rotations 
on a lattice K at the spots p of Q^m) has been completely solved (see 
Kneser [11] and Earnest and Hsia [5]) unless p is a ramified dyadic spot; i.e., 
p 12 and e(p 12) = 2. In this section we calculate these spinor norm groups 
for the particular case of such localizations of a lattice L lifted from Q. Since 
2 ramifies in Q(^/m), m must be congruent to 2 or 3 modulo 4, and the 
results here vary significantly depending upon the specific congruence. The 
computations depend strongly upon the fact that the residue class field of 
Q(y/m) at p contains only two elements; in particular, any sum of two units 
of lies in the ideal p. 

In this section, we adapt our notation to this purely local situation. A 
dyadic prime spot on Q(y/m ), m = 2 or 3 (mod 4), is denoted by p and F is 
the completion of ©(^/m) at p. L will be a lattice over Z 2 , the integers of Q 2 > 
and so L is a lattice over o, the ring of integers of F. Since the spinor norm 
groups are unaffected by scaling, we shall assume sL= Z 2 . u 2 and u p are the 
groups of units of Z 2 and o, respectively. Finally, n denotes a fixed prime in 
F; i.e., no = p. Then 2 = n 2 p for some /? e o. In the following proposition, 
which makes clear the local distinction of the cases m = 2 or 3 (mod 4), we 
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describe a choice of the element n and examine the resulting /J. The symbol 
2 denotes the quadratic defect as described in O’Meara [12]. 


Proposition 2.1 S>(2) = p 3 om = 3 (mod 4). 

Proof If m = 2 (mod 4), write m = 2m', m' an odd integer. Let 
n — y/m/m f . Then n 2 = m/m' 2 = Irri/rri 2 = 2/m'. So, 2 — n 2 rri\ that is, 
/? = m' g u 2 , so @(P) £ p 3 . Hence, ^(2) £ p 5 and it follows that ^(2) = 0. 

Conversely, if m = 3 (mod 4), then write m = 3 + 4t and let a = 2 + 
2t — ^/m. This a e o and a direct calculation shows N f/Q2 (ol) g u 2 and it 
follows that a g u p . Choose n = (y/m — l)/a. Then 2 = n 2 oc. Furthermore, 
a = 1 + (1 + 2t + y/m) and N F/Q2 ( 1 + 2t + ^/m) is a prime element of Z 2 , 
so 1 + 2t + yjm is a prime element of o. In this case, &(a) = p and 2 = n 2 a 
gives @(2) = p 3 . QED 


The next proposition reduces the problem of calculating spinor norms of 
all elements of 0 + (L) to that of performing the calculations only for the 
symmetries in 0(L). 

Proposition 2.2 O(L) is generated by the symmetries S x and Eichler trans¬ 
formations F w in 0(L). 

Proof Apply induction on dim L. It suffices to consider those L with 
sL = Z 2 . For such a lattice, L = L x _L M with L x unimodular and 
sM <= 2Z 2 . If L x is split by a hyperbolic plane the result follows as in 
O’Meara and Poliak [13,14]. Since the orders of norm and weight generators 
of Li are both even, is split in that way whenever dim L x > 3. So we need 
only consider dim L Y < 2 . Furthermore, if L x ^ ^ 1 ( 2 , 2) or L t unary, the 
result follows as in proof of Proposition 2.1 of Earnest and Hsia [7]. 

Suppose Lj = A(e , 28), with e, 8 e u 2 , in basis {x, y} and o e O(L). Then 
at least one of the symmetries S <r(x) ^ x or S SyfT(x) _ x lies in 0(L) and the result 
follows by induction since ox splits L and there is a 2 with S z e 0(L) for 
which S 2 o or S z S y o fixes x. If L x = A(l, 0) or ^4(1, 4) in basis {x, y}, then one 
of S ff{x) - x or S Snx+yff(x) _ x lies in 0(L) and the result follows similarly. This 
exhausts all possibilities. QED 


We need one further calculation in order to simplify the statements of 
later results. 


Lemma 2.3 Let d e u 2 . Then the F-space [1, d] represents all units h that 
lie in 1 4 - p 2 . 

Proof The quadratic defect @( — d) is contained in p 3 . If 3>( — d) ^ 4o, 
then the result follows since the Hilbert symbol (h, —d) p = +1 for any 
h g u p . So we treat the case ^( — d) = p 3 . Write — d = 1 + n 3 e, e e u p . A 
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basis {x, y} for the space can be chosen for which the corresponding matrix is 



We shall show there is a vector v = x + Cy with C e F for which Q(v) = h. 
By perfectness of the residue class field, h can be written as h = 1 + 2r, r e p. 
Now Q(v) =1 + 2 C - n 3 eC 2 - 1 + 2(C + jie'C 2 ), e' e u„. The polynomial 
f(X) = ne'X 2 + X — r is reducible by the reducibility criterion of 13 : 9 [12]. 
That is,/(X) has a linear factor, and thus a root in F. Taking C equal to this 
root gives the desired vector. QED 

We now proceed with specific calculation of the spinor norm groups. 
These groups were computed in Hsia [9] for modular lattices with no restric¬ 
tions on the nature of the local field. From results obtained there, it can be 
seen that 0(0 + (L)) = u p F 2 whenever dim L > 2 and L modular but 
L 9 = A(e, 2d) where e, d e u 2 . Proposition E of Hsia [9] applies to this excep¬ 
tional case, but does not yield a solution in closed form. However, in the 
present context the situation is seen to be considerably simpler. 

Proposition 2.4 Let L S A(\, 2y), y e u 2 . 

(i) If ms 2 (mod 4), then 0(O + (L)) = (1 + p 2 )F 2 . 

(ii) If m = 3 (mod 4), then 0(0 + (£)) = u v F 2 . 

Proof We first show that (1 + p 2 )F 2 £ 0(O + (L)) in any case. The lattice 
L can be rewritten over Z 2 as L £ < 1) _L (dy, d — det L e u 2 . Let 
h = 1 + 2t, t e o p . Then g<fx, h _1 d) = ho 2 + 2o p = o 2 + 2o„ = 0 < 1 , d > and 
the spaces [h, h _1 d] and [1, d] are isometric over F by Lemma 2.3, so 
(h, h~ l d) and <1, d) are isometric over o by 93 : 16 of [12]. In particular, 
(1, d) represents h over o. Since he u p , the vector of L of length h gives rise 
to a symmetry of L 

Next, suppose v = Ax + Cy e P(L), where {x, y} is a basis adapted to 
A(l, 2y). Further, write 2y = n 2 f noting that / e u 2 when ms 2 (mod 4) and 
Q>(f) = p when ms 3 (mod 4). If A e u p , Q( v) is clearly an element of 
(1 + p 2 )u 2 . So assume A e p and, without loss of generality, C = 1. Thus, 
Q(v) = A 2 + n 2 f If ord p A = 1, then Q(v) e p 3 ; but B(v, x) e u„. So no such 
v can give rise to an integral symmetry. If ord„ A >2, it is clear that 
ordp Q(v) = ord p ( 7 t 2 /) = 2 and, thus, we get 0(O + (L)) £ u p F 2 . In the case 
ms 2 (mod 4), this containment can be strengthened because Q(v) = A 2 + 
n 2 f = 7 t 2 /(1 + A 2 n~ 2 f ~') and ord p A >2 shows that 

1 + >l 2 7 r " 2 / _1 e 1 + p 2 , 

but furthermore ,/e 1 + p 2 since f e u 2 . So Q(v) e (1 + p 2 )u 2 and we obtain 
the reverse containment, (1 + p 2 )F 2 £ 0(0 + (L)). However, when 
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m = 3 (mod 4), Q(v) $ (1 + p 2 )F 2 for the vector v = 2x + y e P(L). So in 
that case, 0(O+{L)) = u p F 2 . QED 

Remark 2.5 In the present context, 1 + p 2 = u 2 (l + p 2 ) is a subgroup 
of u p of index two. 

Hence we see that the spinor norm group 6(0 + (L)) has a very convenient 
form when L is modular. To compute the general nonmodular case, we will 
be able to determine the spinor norms using the components of a Jordan 
splitting and sublattices of the form <1)1 <2 r y>, r > 1 and y g u 2 . We now 
turn our attention to these binary nonmodular lattices. 

Proposition 2.6 Let m = 2 (mod 4) and L ^ <1, 2y>, yeu 2 . Then 
6(0 + (L)) = (1 + p 2 )F 2 . 

Proof Let {x, y} be a basis for <1, 2y>. Write 2y = n 2 f where, since 
m = 2 (mod 4),/can be chosen from u 2 . In particular,/ = 1 + 2g, g g u 2 . 

To show the containment take v = Ax + Cy e P(L), A, C s o. If 
A e u p , C g p it is clear that Q(v) e (1 + p 2 )P 2 . If both A e n p and C e u p , 
assume ,4 = 1 and write Q(v) = 1 + n 2 C 2 f = 1 + n 2 C 2 + 2n 2 C 2 g = 
(1 -h nC) 2 + 2n 2 C 2 g — 2nC e (1 + p 2 )F 2 . When A e p, assume C = 1; then 
if ord p A > 2, Q(v) = 7 t 2 /(1 + ^ 2 7r" 2 / _1 ) e (1 4- p 2 )P 2 since/e 1 + p 2 . The 
case C = 1, ord p /4 = 1 does not give a symmetry in 0(L). This exhausts all 
possibilities. 

Conversely, if h e 1 + p 2 , then <l,/> ^ </i, ft _1 /> by Lemma 2.3 and 
93 : 16 [12]. So there are s, t e o for which h = s 2 + t 2 f If t e u p the vector 
v = nsx + ty of L has length 7i 2 /z. If t e p, the vector v = sx + t7i -1 y has 
length h. QED 

Proposition 2.7 Let m = 3 (mod 4) and L ^ <1, 2y>, yeu 2 . 77ien 
0( o + (£)) = W G F: (; rj , -/)„ = -h 1} w/iere 2y = tt 2 / 

Proo/ Consider the lattice if ^ <!,/> over o in basis [x, w}. The lattice 
with basis {x, nw} is isometric to L, so we may view Las a sublattice of if. In 
particular, 6(0 + (L)) ^ 0(0 + (if)). Conversely, suppose 

v = Ax + Cw g P(if). 

If A e u p , C g p, say C = 7t(, then u = Ax + t,y e P(L) with Q(u) = Q(v). If 
C g u p , assume C = 1 and consider the vector u = ^7rx -f y g L. Then 
Q(w) = 7r 2 Q(i;) and it remains only to verify that S u e 0(L). When A e p, 
Q(u) g 2Up and S u e 0(L). If A e u p , then we observe that @(f) = p since 
m = 3 (mod 4) and that |6(^)| = \A 2 +f\ — \n\. Furthermore, 
2 B(u, L) = p 3 = Q(u )o and S u g 0(L) in this case also. Thus, we have shown 
that 6(0 + (L)) = 6(0 + (if)). But the group on the right can be determined by 
Proposition B of Hsia [9] and is seen to equal the set of elements satisfying 
the Hilbert symbol (i y \, —f) p = + 1. QED 
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Remark 2.8 In the case described by the above proposition, 0(0 + (L)) 
has index two in F. This is the first instance in which we do not have 
0(0 + (L)) £ u„F 2 . It will be seen (Lemma 2.19) that this is the only way in 
which this containment can be violated. 

The calculation of 0(0 + (L)) when L = <1, 4y>, y e u 2 has been done in 
Proposition 3.1 of [7], Here there is no distinction between m = 2 or 3 
(mod 4) and, combining this result with Lemma 2.3, we obtain: 

Proposition 2.9 Let L = <1, 4y>, y e u 2 . Then 9(0 + (L)) = (1 + p 2 )F 2 . 

Additionally, the cases L = <1, 2 r y>, y e u 2 , r > 4 are determined by 3.3 
and 3.4 of [7], So the binary cases will be completely resolved with 
the calculation of 0(0 + (L)) for Ls( 1, 8y>. Again in this case, we 
treat separately m = 2 (mod 4) and m = 3 (mod 4). In either case, if 
(x, y} is a basis for the given splitting and v = Ax + Cy e P(L), restrictions 
on the possible values of A and C are given by Proposition 3.2 of [7]. From 
these restrictions, it can be easily seen that 0(0 + (L)) £ u p F 2 . 

Proposition 2.10 Let m = 2 (mod 4) and L=(l, 8y>. Then 
0(0 + (£)) = (1 + p 2 )F 2 . 

Proof It is easy to see that (1 + p 4 )F 2 £ 0(O + (L)) £ (1 + p 2 )F 2 by 
using Proposition 3.2 [7]. So it will suffice to show that for any t = 1 4- 27te 
with £ e u„, there is v e P(L) with Q(v) e xF 2 . We shall show there is such a 
v = Ax + Cy with C e u p and A = 2a, a e u p . 

Since m = 2 (mod 4) we can write 8y = 4n 2 f where fe u 2 . In particular, 

g i + p 3 and C 2 6 1 + p 2 gives -C 2 n 2 f = n 2 (l + 2s) for some s 6 o. 
Consider 


x — C 2 n 2 f = 1 + 2ti£ — C 2 n 2 f = (1 + 7t) 2 


. 2tt((£ - 1) + Tts) 
(1 + n ) 2 


If £ i 1 + p 2 , then by taking C = 1 + n we see that s e u p and the expression 
in square brackets is of the form 1 + 4izr\, rj e o and is thus a square by the 
local square theorem; say (1 + 7t) 2 (l + 4nr/) = a 2 , a e u p . Then the vector 
v = 2ax + (1 + n)y has length Q(v) = 4a 2 + (1 + 7r 2 )47i 2 / = 4(a 2 + (1 + 
n 2 )fn 2 ) = 4(t - (1 + n) 2 n 2 f + (1 + n) 2 n 2 f) = 4t e xF 2 . On the other hand, 
if eg 1 + p 2 , then taking C = 1 gives se p and again the expression in square 
brackets is a square; say a 2 = (1 + rc) 2 (l + 4nri) again. This time v = 2ax + y 
is the desired vector. QED 


Proposition 2.11 Let m = 3 (mod 4) and L s <1, 8y>, y e u 2 . Then 
9(0 + (L)) = {ne u p : (rj, -/)„ = + 1}F 2 where 8y = 4n 2 f 
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Proof We again view L as a sublattice of <£ = < 1, / ) <-► {x, w}, y = 2nw. 
It suffices to show that 6(0 + (L)) = (u p n Q(P(^)))F 2 . Let u = Ax + 
Cy e P(£). If A e u„, then Q(u) e F 2 . If C e u„ we may take C = 1 and 
restrictions on the value of A are given by 3.2 of [7]. For 0 < ord p A < 2, 
the vector v = x + 2nA~ x w lies in P(i?) and has length A~ 2 A(u). For 
ord p A > 4, the vector v = (A/2n)x + w lies in P( <£) and has length 
(\l2n) 2 Q(u). This shows 0(0 + (L)) ^ (u p u Q(P(j£?)))F 2 . 

Conversely, let v = Ax A Cw g P(£P), Q(v) e u p . If Cep 3 , write 
C — 2nC 0 , C 0 e o p , and consider the vector u = Ax + C 0 y. This vector has 
length Q(v) and lies in P(L). If 0 < ord p C < 2, then 27 tC _ 1 e u p and 
v = 2nC~ l x + y e P(L) has length Q(u) = (2nC~ l ) 2 Q(v). So we are left with 
C e u p ; take C = 1. The vector u = 2nAx + y has length Q(u) = (2n) 2 x 
(A 2 +/) = (27r) 2 Q(t>). When A e p, 2nA e p 4 and so u e P(L). Finally, if 
^4 e u p , then \Q(v)\ = \n\ since @(f) = p, and Q(r) does not lie in 
Q(P(J?)) n u p )F 2 . QED 

We turn now to the question of determining 9(0 + (L)) for a general Lby 
considering a Jordan splitting of L. The results vary markedly depending 
upon the congruence of m mod 4. We treat first the case m = 2 (mod 4). In 
this case, we have observed for binary L that 0(0 + (L)) £ u p F 2 . This result 
can be extended to lattices of arbitrary dimension by an inductive argument 
and a consideration of all possibilities for the structure of the first Jordan 
component of a splitting of the given lattice. The result is stated here without 
proof. 

Lemma 2.12 If m = 2 (mod 4), then 9(0 + (L)) £ u p F 2 . 

This lemma has several immediate consequences. 

Theorem 2.13 Let m = 2 (mod 4). 

(i) If L has a Jordan component of dimension 3 or more , then 

e(o + (ty) = Up t 2 . 

(ii) If L has a binary Jordan component which is not of the form 
2 r A(e, 23), re N u {0}, £, 3 e u 2 , then 0(O + (L)) = u v F 2 . 

So we may now assume that L has a Jordan splitting 
L = Lj J_ 2' 2 L 2 -L • • ■ _L 2 r ‘L, with dim 1^ <2 and all binary L ; have the 
form A(e, 23), e, 3 e u 2 ■ 

We first treat the case when all L t are unary. 

Theorem 2.14 Let m = 2 (mod 4), 

L = Z 2 x l 1 1 Z 2 x, = <2 ri £!> 1 <2 r2 £ 2 > 1 1 <2 r 'e t > 

be a Jordan splitting of L Then 
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Proof This result has already been obtained in 3.8 [7] unless there are 
indices k, s, and t for which r k+ j — r k = 1 or 3 and r s — r, = 2 or 4. For such 
lattices we have established that 

0(O + (L)) 2 (1 + p 2 )F 2 = e(0 + ( ox k + ox t+1 )). 

So it suffices to show that 0(0 + (L)) = (1 + p 2 )F 2 . For any vector 
v = £•=, A i x i e F(L), the condition m = 2 (mod 4) assures that there is 
exactly one index i = i 0 for which AfQ(x t ) has maximal value. But since each 
AfQ(xi) has even order, it follows that 

Q(v) = Q(A i0 X i0 ) + X e Q(A h X io )(l + p 2 ) 

i + io 

<=(l + p 2 )£(l + p 2 )F 2 . QED 

We now treat the case when there is at least one binary component and 
all binary L, have the form A(s , 23 ), s, 3 e u 2 . By Proposition 2.4, 0(0 + (L)) 
contains the subgroup (1 + p 2 )F 2 of index two in u p F 2 ; thus, 0(0 +(L)) must 
equal either (1 + p 2 )F 2 or u p F 2 . If the components are not too closely 
bunched, only (1 + p 2 )F 2 is attained. 

Theorem 2.15 Assume that m = 2 (mod 4) and L has a Jordan splitting 
of the type described above. Then 0(0 +(L)) = (1 + p 2 )F 2 unless Lis split by a 
sublattice 2 r K of the form K = <e> _L 2A(t], 2p) or K = A(rp 2p) 1 2<e>. In 
this case , 0(0 + (L)) = u p F 2 . 

Proof Suppose L contains a sublattice 2 r K of one of the given types in 
basis {x, y , z}. If K = <e> 1 2A(rp 2p) the vector v = 2x + ny + z e P(K) 
has Q(v) £ (1 4- p 2 )F 2 . Since 0(O + (A(r ], 2p))) = (1 + p 2 )F 2 has index two in 
u p F 2 , 0(O + (L)) must be all of u p F 2 . 

To show that in all other cases 0(0 +(L)) = (1 4- p 2 )F 2 , apply induction 
on the number of Jordan components making use of the following lemma: 

Lemma 2.16 Suppose L= K _L M where K — A(yi, 2p) or K = <e) and 
Mc4Z 2 . lf0(O + (M)) £ (1 + p 2 )F 2 , then 0(0 + (L)) £ (1 + p 2 )F 2 . 

Proof We shall write only the case K = A(rj , 2p) since the argument for 
K = <e> is contained therein. For any v e P(L), write v = Ax + Cy + z 
where {x, y} is a basis for X, A, C e o and z e M. If ord p A < 1, then 
Q(z) e Q(Ax + Cy)(l -h p 2 ). When ord p A > 2, v e P(L) forces either C e u p 
or both A and C lie in p 3 . In the latter case, if Q(z)e4u p , then 
Q(Ax + Cy) e Q(z)( 1 + p 2 ). We are left with the case Q(z) e 8o and 
| Q( v ) | = | Q( z ) |. If ord p Q(z) = U the condition v e P(L) gives A, C e p x ~ 2 . 
So A 2 , C 2 , and AC all lie inside p 2f_4 and, since t > 6, we have 2t - 4 > 
t + 2. Thus, Q(v) e Q(z)(l + p 2 ). This exhausts all possibilities and the 
lemma is proved. QED 
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We now complete the case m = 3 (mod 4). The Theorem 2.14 remains 
true also in this case. 

Theorem 2.17 Let m = 3 (mod 4) and let L= Z 2 x t ± ••• 1 Z 2 x f = 
<2 ri e 1 ) _L • • • _L <2 r, e,> be a Jordan splitting of L. Then 

{ even | 

Q(v ): v e P(oxj ± ox j+l ), 1 <j < t - l|. 

Proof The result follows from Theorem 3.8 of Earnest and Hsia [7] 
unless there are indices k, s, and t for which r k + x — r k = 1 or 3 and 
r s — r t = 2 or 4. If r k+ x — r k = 1 and r s — r t = 2, then 0(0 + (L)) = F because 
0(O + (ox s 1 ox,)) = (1 + p 2 )F 2 £ 0(0 + (ox k _L ox k + x)). This follows from 
the duality lemma of Ankeny and Hsia [1] which shows that for —/ with 
@( —/) = p there is an element p with <2>(p) = p 3 for which (//, —/) p = — 1. 
So we may assume there are no such indices. 

For a vector v = £S =1 AfX, e P(L), let k be the largest index for which 
ord p Q(A k X k ) is minimal. If there is another j < k with ord p Q(A k X k ) = 
ord p Q(AjXj), it follows easily that j = k — 1, r } = r k — 1, e u p , and 
\Aj \ = 17T |. Since r s — r t ^ 2 for any s, t in this case, we have r k — r h > 4 
for h < k — \ and r h — r k > 3 for h > k and it follows that 
<2(t>) g + v4 fc AT k )/’ 2 . So assume minimality occurs only at the 

index /c. Then \A k \> \2\ since S v e O(L). Suppose A k eu p and that 
r k + 1 — r k = 1. We treat here the case r fc+1 = r k + 1. If j > k + 1, then 
rj — r k > 4 and Q(A; X ; ) e 8Q(t;)o and similarly if j < k with r fc — > 4. If 

»•* = »•*-1 + 3 and 1 e4u,,,wehave = |2| | Q(v )| and a 

calculation shows that 

Q(Ak- 1 X k ~ i + A k+1 X k+1 ) = A k + 1 2 n + 1 (a k - x e 2 + a k+1 ) 

where £ g u p . The last factor, being a sum of units of defect ^ p 3 , lies in 2o. 
That is, Q(A k _ 1 X k _ 1 -f A k+l X k+l ) g 4A(v)o. On the other hand, if 
A+i e p, then C(A+i ^ k +i) e 4Q(i?)o and Q(v) e Q(A k . x X k . x + A k X k )x 
(1 + 4o) and the result follows. Keeping the assumption r fc+1 — r k = 1, the 
other possibilities | A k | = \n\ and \A k \= \2\ are resolved in a similar 
manner. 

The above argument shows that when there is no index k with 
r k+i ~ r k = 1 we have Q(v) e u p F 2 for all v e P(L). If this is the case and 
there are indices /, r, and s with r l+1 — r t = 3 and r t — r s = 2 the result again 
follows since 0(O + (L)) = u p F 2 . So to complete the proof we need only verify 
the result for a v e P(L) whose value equals \Q(A k X k )\ and r k — r k _ t >3 
and r k+l — r k > 3. This can be easily done using an argument analogous to 
the one given above. QED 

When L does not have the diagonal decomposition discussed above, 
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9(0 + (L)) contains u p F 2 since this is the spinor norm group for any modular 
component of dimension greater than one. The exact value is then calculated 
by the following theorems. 

Theorem 2.18 Let m = 3 (mod 4) and L have a Jordan component of 
dimension greater than one. Then 9(0 + (L)) = u p F 2 if and only if L is not split 
by a sublattice 2 r (e, 2y>, s, y e u 2 . If Lis split in this way, then 9(0 + (L)) — F. 

The second statement is clear since 9(0 + «e, 2y») has index two in F. 
The first follows immediately from the following lemma, which is proved by 
induction on the number of Jordan components in a splitting of L 

Lemma 2.19 If L is not split by a sublattice of the type 2 r <e, 2y>, then 
0(O + (L))^u p F 2 . 


3. Applications 

In Earnest and Hsia [6] a construction was given that produced a lattice 
L having arbitrary dimension d and g + (L) = 2 n for any prescribed integer n. 
Now, additionally, let k be any positive integer with k <n. We now give an 
alternative construction of a lattice L having both the properties above but 
also for which an infinite number of extensions E — Q(>/ m ) can be con¬ 
structed so that |Ker(^ L )| = 2 k in each of these extensions. 

Example 3.1 Let q u q n be distinct odd primes for which the 
Legendre symbol (2/^) =1 for i = 1, ..., n. The lattice L= <1, f 2 , ..., 
pi(d— i)) di mens j on d and g + (L) = 2”, as can be easily calculated using 
the technique described in Section 1. In particular, the primes q { may be 
chosen so that q x = 1 (mod 8) for each i = 1, ..., n. 

Proposition 3.2 There are infinitely many extensions E = Q(y/m) for 
which | Ker (\j/ L ) | = 2 k where the lattice L is constructed as above. 

Proof Let p be a prime that satisfies: 

(i) p = 3 (mod 8); 

(ii) (p/qi) = -1 for i = 1, ..., k\ 

(iii) (p/qi) = + 1 for i = k + I, ..., n. 

We claim that when E — QfJ—p), \ Ker(^ L ) | = 2 k . First note that 

— p e A qi Q%. for i — 1, ..., k. So the ideles j(i) defined by 

all have the property that ^ L (j(i)))=( I) for i = 1 , ..., k; hence 
|Ker(^ L )| > 2 k . It remains only to show that no element j = YlUk+i j(i) £i * 


<(i)) q = 


if q = q x 
otherwise 
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£j e {0, 1}, becomes trivial in ^ E (L). By Theorem 1.7, j e Ker(i// L ) if and only 
if h(j) e yJ E where y is a product of the elements — 1, 2, and p. Since 
qi = 1 (mod 8), — 1 and 2 are squares in and the choice of p assures 
p e (ft 2 for i = k + 1,..., n. Thus, | Ker(t// L ) ( = 2*. 

The p used above can be chosen to be any prime that is a solution of the 
linear congruence system 

Y = 3 (mod 8) 

Y = f, (mod q t ) 


Y = t n (mod q n ) 

where t t e 1 satisfy (t f /q f ) = — 1 , / = 1,..., k and (t,/q,) = +1 for i = k + 1 , 
..., n. QED 


Remark 3.3 For the lattices L and extensions E constructed above, it 
can be shown that, although the map \j/ L is not injective, the number g + (L) is 
at least as large as g + (L). The next example will show that the value of g + 
may decrease in passing from Q to E. 


Example 3.4 Consider the space V = [1, 1, — 1, — 1] over Q. Let K be 
the Z-lattice on V given by K S < — 1, — 5 2 , 5 4 , 5 6 > and let J 2 be the lattice 
on V 2 given by 


Ji 


0 1 
1 0 


1 


0 1 
1 0 


By 81: 14 of [12], there is a lattice L on V for which L p = K p for p odd and 
L 2 = J 2 • This lattice L has g + (L) = 2 but g + (L)— 1. 

By using the machinery established in Earnest and Hsia [6, 7], and 
Section 2, sufficient conditions in terms of the lattice L and the extension E, 
for the number g + to be nondecreasing in passing from Q to E = 
can be obtained. Another group homomorphism q can be constructed, 
q: & q (L) -» & e (L), and shown to be injective under the given condition. 
Details of the construction can be found in Earnest [2], Let Sf(E) denote the 
set of all rational primes that divide disc(E/G). 

Theorem 3.5 Let Lbe a Z-lattice, dim L > 3. Then 
<T(L) n Sf(L) = 0^g + (L)> g + (L). 


Corollary 3.6 If no rational prime divides both disc(£/0) and det L, then 
g + (£)>g + (L). 

The computations in Section 2 can be used to generalize to quadratic exten- 
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sions the results of Kneser [11] and Earnest and Hsia [5] giving sufficient 
conditions, in terms of the reduced determinant of a /-lattice L, for the 
number g + (L) to be one. We obtain here a somewhat weaker condition 
which guarantees only that g + (L) = 1 when L is lifted into a quadratic 
extension with odd ideal class number. 

Notation used here will conform to that introduced in Earnest and Hsia 
[5]. In particular, define d'L to be d'L = det(Z?“ l ) where nL = aZ. If 3(L) 
denotes reduced determinant defined in Eichler [8], d'L is related to d(L) by 
the formula 8(L) = 2 n (d'L)Z. Note in particular that, in contrast to S(L), d'L 
is not always an integer but may be rational with 2-power denominator. So 
d'L = Y[p P SP with s p eZ and s p positive for p odd (s 2 may be negative). 

Let E = Q(y/m) and let the ideal class number of E be h E = h 2 (E) • h 0 (E) 
where h 2 (E) is a power of 2 and h 0 (E) is odd. We seek to determine the best 
possible bound ^ so that the conditions 

(i) s p < (n(n — l))/2, n = dim L, 

(ii) s 2 < $ 

will imply that g + (L) < h 2 (E) (or that g + (L) < 2 h 2 (E) when L definite, m> 0 
with N EjQ (e) = — 1). In particular, when h E is odd, we obtain g + (L)= 1 (or 
g + (L)< 2, respectively). The bound & varies depending upon the field E. Of 
course, if 2 is unramified in £, then the bound $ is exactly as determined in 
[5]. So we restrict our interest to the cases m = 2, 3 (mod 4). The proofs of 
the following results are consquences of the results of Section 2 and the index 
computations in Earnest [3]. 

Proposition 3.7 (i) If m = 2 (mod 4), the best possible value of & is 

when m < 0 or m > 0 with N eiq (e) = + 1 (fi denotes the fundamental unit of 
£). 

(ii) If m = 2 (mod 4), m > 0 with N E/Q (s) = — 1 and the lattice L is 
indefinite , then the best possible bound $ is 2n(n — 1). 

[Examples showing the given bounds are best possible are 
L = _L|i"o 2>/2 <2 2i M(l, 2) in case (i) and L = <-7>l(J_f= 1 1 <2 4l » in case 

(ii)] 

Remark 3.8 In case m = 3 (mod 4), it is not as easy to determine the 
best possible value of If m < 0, the result is as follows: 

(i) For m = — 1, best possible value is B" = n(n — 1). 

(ii) For — 1, best possible value is B' = n(n — 1) — [n/2J 

For m > 0, the best possible bound may be either B\ B'\ \B'\ or 2 B" and the 
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precise value can be determined by checking whether ±2e N e/q (E), 
whether the trace Tv E/Q (e) = 0 (mod 4) and by evaluating the symbol (e,/) p 
where p 12 and 2 = n 2 f in E p . 

Remark 3.9 Precise values of g+(L) are determined for unimodular lat¬ 
tices L in [3]. Also obtained as a consequence of the index computations 
there is a crude upper bound for the values of g + {L) for an arbitrary L. 

REFERENCES 

[1] N. C. Ankeny and J. S. Hsia, Unimodular round forms over dyadic local fields, J. Number 
Theory 6 (1974), 443-447. 

[2] A. G. Earnest, Spinor norms and spinor genera of integral quadratic forms under field 
extensions, Ph.D. Dissertation, Ohio State Univ., Columbus, Ohio, 1975. 

[3] A. G. Earnest, Spinor genera of unimodular Z-lattices in quadratic fields, Proc. Amer. 
Math. Soc., to appear. 

[4] A. G. Earnest and J. S. Hsia, Springer-type theorems for spinor genera of quadratic forms, 
Bull. Amer. Math. Soc. 81 (1975), 942-943. 

[5] A. G. Earnest and J. S. Hsia, Spinor norms of local integral rotations, II, Pacific J. Math. 
61 (1975). 

[6] A. G. Earnest and J. S. Hsia, Spinor genera under field extensions, I, Acta Arith., to 
appear. 

[7] A. G. Earnest and J. S. Hsia, Spinor genera under field extensions, II: 2 unramified in the 
bottom field, Amer. J. Math., to appear. 

[8] M. Eichler, “Quadratische Formen und orthogonale Gruppen.” Springer-Verlag, 
Berlin-Gottingen-Heidelberg, 1952. 

[9] J. S. Hsia, Spinor norms of local integral rotations, I, Pacific J. Math. 57 (1975), 199-206. 

[10] G. Janusz, “Algebraic Number Fields.” Academic Press, New York, 1973. 

[11] M. Kneser, Klassenzahlen indefiniter quadratischer Formen in drei oder mehr Verander- 
lichen. Arch. Math. (Basel) 1 (1956), 323-332. 

[12] O. T. O’Meara, “Introduction to Quadratic Forms.” Springer-Verlag, Berlin-Gottingen- 
Heidelberg, 1963. 

[13] O. T. O’Meara and B. Poliak, Generation of local integral orthogonal groups, I, Math. 
Ze. 87 (1965), 385-400. 

[14] O. T. O’Meara and B. Poliak, Generation of local integral orthogonal groups, II, Math. 
Ze. 93 (1966), 171-188. 


AMS (MOS) 1970 subject classifications: 10C05, 15A63. 



On Products of Consecutive Integers 

P. ERDOS E. G. STRAUS^ 

HUNGARIAN ACADEMY OF SCIENCE UNIVERSITY OF CALIFORNIA 

BUDAPEST, HUNGARY LOS ANGELES, CALIFORNIA 


1. Introduction 

We define A(n , k) = (n + k)\/n\ where n , k are positive integers and we 
wish to examine divisibility properties of A(n , k). 

First observe that the cases k — 1, 2 are special. The relations 

tA(n, 1) - A(m, 1) and tA(n, 2) = A(m, , 2) 

each have infinitely many solutions n , m—the first for every positive integer 
t; the second for every positive integer that is not a square, as can be seen 
from Pell’s equation 

(2m + l) 2 - t(2n + l) 2 = 1 - t. 

On the other hand, it is well known from the Thue-Siegel theorem that 
for given k > 3 and fixed t > 1 the equation 

tA(n, k) = A(m , k) (1.1) 

has only a finite number of solutions in integers n , m. 

t Research of the second author was supported in part by Grant MCS 76-06988. 
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It is possible that (1.1) has only a finite number of solutions for fixed t 
and variable k> 2, but we cannot prove this even in the case t — 2 without 
additional hypotheses on n. Perhaps the following stronger conjecture 
holds: A(n, k) = tA(m, l ) has only a finite number of solutions m > n + k for 
every rational t if min(fc, /) > 1, max(/c, /) > 2. 

Let / (n, k) be the least positive integer m > n so that 

A(m, k) = 0 (mod A(n, k)). (1.2) 

We obviously have f(n, k) < A(n, k) — k and it is easy to see that for k > 1 
we get several residue classes (in addition to - k) (mod A(n, A:)) for m which 
ensure that (1.2) holds. The number of these residue classes is always larger 
than k, in fact exponential in k, so that the above inequality on /(n, k) can 
always be improved. 

The algebraic identities 

x(x + l)(x + 2) = (x 2 + 2x)(x + 1) 

which divides (x 2 + 2x)(x 2 + 2x + 1) 
x(x + l)(x + 2)(x + 3) = (x 2 + 3x)(x 2 + 3x + 2) 

x(x 4- l)(x + 2)(x + 3)(x + 4) = (x 2 + 4x)(x 2 + 4x + 3)(x + 2) 

which divides (x 2 + 4x)(x 2 + 4x + 3)(x 2 + 4x + 4) 

show that A(n, 3) divides A(n 2 + 4n + 1, 3); A(n, 4) divides A(n 2 + 5n + 
2, 4); and A(n, 5) divides ,4(n 2 + 6n + 4, 5). Thus f(n, 3) < n 2 + 4n + 1, 

/(», 4) < n 2 + 5« + 2, /(n, 5) < n 2 + 6n + 4. It seems likely that these 

bounds are attained for infinitely many, perhaps for almost all, values of n. 
One might ask whether we get/(n, k) < n k ~ s for all (almost all) large n and 
some S > 0 when k > 5. In the other direction we would like to know 
whether /(n, k) > n 1+d for infinitely many (almost all) n for some S > 0 
when k > 1. For k = 2, we have infinitely many n with/(n, k) ~ y/2n. Are 
there infinitely many n with/(n, k) < cn for fixed c, k > 2? 

A function closely related to f(n , k) is g(n , k), the minimal integral value 
A(m , k)/A(n , k), m> n. The above discussion shows that g(n , k) ~~ 
f(n, k) k /A(n , k) for fixed k and thus g(n , k) n k for k < 5. Table 1 gives 
values of f(n , k), g(n , k) for small n and k. 

One may try to estimate the density d(n , k) of integers m for which (1.2) 
holds. Obviously d(n , 1) = l/(n + 1). For k = 2 we have 

d(n,2) = 2" 2 »M(n,2), (1.3) 

where co(x) denotes the number of distinct prime factors of x. Relation (1.3) 
follows from the fact that A(m , 2) is divisible by A(n , 2) if and only if we can 
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TABLE 1 


\k 
n > 

1 


2 


3 



4 


5 


6 


7 

/ 

9 

/ 

9 

/ 

9 

/ 

9 

/ 

9 

f 

9 

/ 

9 

1 

3 

2 

2 

2 

3 

5 

2 

3 

4 

21 

2 

4 

3 

15 

2 

5 

2 

7 

6 

3 

2 

6 

14 

4 

6 

3 

3 

5 

22 

3 

7 

2 

14 

12 

7 

6 

4 

2 

11 

78 

6 

11 

13 

646 

4 

9 

2 

8 

3 

12 

13 

6 

3 

13 

68 

22 

1794 

20 

2691 

5 

11 

2 

13 

5 

13 

10 

52 

2915 

13 

34 

6 

2 

17 

437 

6 

13 

2 

47 

42 

25 

39 

52 

1749 

17 

57 

16 

969 

7 

2 

7 

15 

2 

62 

56 

53 

231 

52 

1113 

31 

476 

50 

18921 

21 

345 

8 

17 

2 

34 

14 

42 

86 

32 

119 

50 

2703 

50 

10812 

20 

138 

9 

19 

2 

43 

18 

19 

7 

51 

All 

51 

1908 

20 

46 

59 

68076 

10 

21 

2 

31 

8 

63 

160 

62 

720 

51 

1272 

60 

11346 

49 

11925 

11 

23 

2 

38 

10 

25 

9 

24 

15 

60 

1891 

46 

1645 

62 

33902 

12 

25 

2 

76 

33 

62 

96 

61 

372 

47 

420 

50 

1749 

50 

5247 

13 

27 

2 

19 

2 

47 

35 

31 

22 

31 

42009 

151 

695981 

169 

11865205 

14 

29 

2 

79 

27 

117 

413 

268 

72899 

131 

30954 

284 

20233213 

149 

3346915 

15 

31 

2 

254 

240 

269 

4065 

302 

92415 

319 

1860516 

284 

14452295 

169 

5393275 


factor A(n , 2) into two relatively prime divisors A(n, 2) = d x d 2 , (d u d 2 ) = 1, 
and require 

m = —l (mod d x ), m = —2 (mod d 2 )> 

Since there are 2 ca( ^ ( "’ 2)) such factorizations of ^4(n, 2), we get that many 
residue classes (mod A(n, 2)) for m. 

For k > 2, the problem of computing d(n, k) becomes messier, but not 
intrinsically difficult. The number of residue classes (mod A(n , k)) to which 
m must belong remains 0(n £ ) for every e > 0 and hence 


n 


1 , x 1 

* < d ( n > k ) < 


tr 


for all values of k . 

Another question is that of determining those m> n so that there exists 
some k for which (1.2) holds. Since for k > m — n, we have 

A(m , k) _ A(n + k, m — n) 

A(n , k) A(n , m — n) 

which is certainly an integer for k = A(n, m — n) — m, the problem becomes 
trivial unless we restrict the values of k to k < m — n. 

For n = 1 and any m we see that 

A(m, p — 1) __ 11 m + p — 1\ 

-4(1, p-l)~p\ p- 1 J 
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is divisible by p unless m = 0 (mod p). Since m cannot be divisible by all 
primes < m except when m = 2, we see that for every m > 2 there exists a k , 
1 < < m — Iso that A(l, fc) divides /l(m, k). The question whether there 

exists a fc, 1 < /c < m ~ 2, so that A(2 , /c) divides A(m, fc), which is equivalent 
to ( k+ 2 2 ) divides ( w * *), seems much more difficult to decide. The general 
problem can be stated as follows: 

Given n > 1 is it true that for all (almost all) large m there exists a k, 
\<k<m — n so that 



If not, what is the density d*(n) of integers m for which (1.4) has a solution 
with 1 < k < m — n? 

In Section 2 we show that for bounded ratios m/n and any <5 > 0 we get 
only a finite number of solutions of (1.2) with k > 3n. 

In Section 3 we treat the special case in which the set {n 4- 1,+ k} 
contains a prime and m/n is bounded to show that (1.2) has only a finite 
number of solutions 2 < k < m — nin that case. We give an example which 
may prove the only one with m < 2n. Finally, we mention some additional 
problems and conjectures. 


2. The Case k>3n 

In this section we prove the following. 

Theorem 2.1 Given positive numbers 3 , A so that k >3n and n + k < 
m < An, then there exists an n 0 = n 0 (3, A) so that the congruence (1.2) has no 
solution with n > n 0 . 

The proof depends on showing that for all large n there exists a prime p 
in the interval [n 4- 1, n + k] that divides A(n, k) to a higher power than it 
divides A(m, k). 

Lemma 2.2 Assume that the hypotheses of Theorem 2.1 are satisfied and 
that every prime p e [n + 1, n + k] divides A(m , k). Then for every s > 0 there 
exists an n x — n x (e) so that for all n > n x there exists an integer /, 2 < / < A, 
with 


(l — e)n + (/ — \)k < m < Ik 4- en (2.1) 

and hence 

(l — 2e)n < k. (2.2) 

Note that it is possible to have every prime that divides A(n , k ) also 
divide A(m y k ). This always happens when k = In, m = l 2 n. 
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Proof Let n be so large that there exists a prime in [n + 1, n + k] and let 
p be the largest prime in that interval. Let / be the largest integer so that 
Ip < m + k. Then / > 2 and for large n we have p > n + k - 3n. Hence 


m+k m + k m-n 

~ p K n + k — Sn < ~*"(l+<5 — S)n 

Now pick n so large that p > n + k — en/ A. Then 
m + k > Ip > In + Ik — en 


< 1 + A - 


= A. 


and hence 

(/ - e )n + (/ - 1 )k < m. 

Let q be the smallest prime so that m + k < (l + l)g. For large n we must 
have n < q since otherwise l times every prime in [n + 1, n + k] would lie in 
[m + 1, m + k\ which is impossible. Let n be so large that 

m + k m + k en 

T+T <q< 7+T + /(/ + i) ’ 

then m < Iq and hence 

(/ + l)m < /(m + k) + en 


or 


m < Ik + en. 


Lemma 2.3 Assume that the hypotheses of Theorem 2.1 are satisfied and 
that s is an integer such that n < k/s and every prime p e [ k/t , (n + k)/t], t = 1, 
2, ..., s, satisfies 

A(m , k) = 0 (mod p r ). 

Assume further that for the integer l in Lemma 2.2 we have 

m + rk - cn < l(n + k) < m + -.k + en (2.3) 

b d 

where a/b and c/d are consecutive terms in the Farey series of order s and e is a 
given number , 0 < e < s~ 2 . 

Then there exists an n 2 — n 2 (e) so that for all n > n 2 we have 

(l - e)n + (/ - < m < (i - k + en (2.4) 

and hence 

k > bd(l - 2e)n > s(l - 2 e)n. ( 2 . 5 ) 
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Proof For 5=1, this is Lemma 2.2. We now proceed by induction on s. 
If max{6, d } < s, then the result follows from the induction hypothesis. 

If d = s > 1, then b < s and the first inequality in (2.4) follows directly 
from (2.3) while the second inequality holds by the induction hypothesis. 

Now assume that b — s, d < s. Then the first inequality in (2.4) is still an 
immediate consequence of (2.3). If the second inequality were false, we 
would have 


.fc , a ,n + k 

si - < m + - k — sn < si -. 

s s s 

If l(n + k) < m + (a/s)k + sn and n is sufficiently large, then there exists a 
prime p so that 


a - _ , a , 

m + - k — 2sn < sip < m + - k + sn 
s s 


and 


n + k sn 


~7i <p< 


n + k 


Hence (si - a)p < m and (si + s - a)p > m + k and A(m, k) £ 0 (mod p% 
contrary to hypothesis. 

We may therefore assume that l(n + k) > m + (a/s)k + sn; then for large 
n there exists a prime p with 


a a en k en n + k 

m + -k<slp<m + -k + — and - H-< p < -. 

s s si s si s 

Thus, again, (si - a)p <m while (si + s - a)p > m + k and A(m, k) # 
0 (mod p s ), contrary to hypothesis. 

Proof of Theorem 2.1 If / > 2 and every prime p e [it/2, (n + lt)/2] 
satisfies A(m, k) = 0 (mod p 2 ), then, according to (2.5), we have k > 3 n. Now 
let s be the largest integer for which sn < k. If every prime p e \k/t, (n + k)/t\, 
t = 1, 2,..., s satisfies A(m, k)sO (mod p'), then, according to (2.5), we have 
k> s(2 — 2e)n > (s + l)n, a contradiction. 

Now if / = 2 and 2n < k, then Lemma 2.3 can be applied as before. If 
/ = 2 and 2n > k, then every prime pe[n+l,(« + k)/2\ satisfies A(n, k) = 
0 (mod p 2 ). But, according to Lemma 2.2, we have 

(4 — 3s)n <m<m + k<( 6 + e)n. 

Thus for large n there exists a prime p, 

(| - e)n < p < (| + e)n, 

so that for sufficiently small e we have 3 p<m while 5p > m + k and 
A(m, k)fO (mod p 2 ). 
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3. The Case A(m , k) = 0 (mod A(h, fc)); n + k < m < An and 
{n + 1, ..., n + k} Contains a Prime 


We first mention the interesting example 

,4(32, 6) = 37 • A(16, 6). (3.1) 

Here two of the integers 17, 18, 19, 20, 21, 22 are primes and 17 may well be 
the largest n which solves our problem in case A = 2. In the following we 
show that we can find an effective bound for all solutions k , n, m. 

From Theorem 2.1 we know that we can restrict attention to cases 
k < dn where 8 is any fixed positive number. Since there exists a prime p with 

n+l<p<n + k<m<An (3.2) 

and A(m , k)/A(n, k) is an integer, there must exist an integer / so that 

m + \ < Ip < m + k. (3.3) 

Thus 


In + l — k < m < In + (l — l)k. 


(3.4) 


Lemma 3.1 Every integer 


x e [n -h 1, n 4- k] 


m + 1 

_______ 


m + k 

~T~ 


has all prime divisors less than (/ + 1 )k. 

Proof Assume that x has a prime divisor q> (l + \)k. Now, either 
lx < m + 1 and lx + q > In + Ik > m + k so that q does not divide A(m, k), 
or lx > m + k and lx — q < In + Ik — (l + l)k < m and again q does not 
divide A(m, k). 

The set of integers in [n + 1, n + k]\[(m + 1)//, (m + k)/l\ contains an 
interval of length >k(l — l)/2Z = ks. 


Lemma 3.2 There exists a k 0 so that A(n , [ks]) has prime divisors greater 
than (l + 1 )k for all k >k 0 , k < dn. 

Proof Set [ks] = t and consider the binomial coefficient 

/n + A _ A(n, t) 

\ t )-~7r 


Every prime power q a that divides a binormal coefficient (”^ f ) satisfies 
q a < n + t. Thus the hypothesis that all prime divisors are < (/ + 1 )k yields 

<(n + t)* ((l+1)k) <(n + t ) c(l+1)k/l <** 


rr 


(3.5) 
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for a suitable constant c. On the other hand 



Now set (n + t)/t = C > 1/S and compare (3.5) and (3.6) to get 

C < C C{1+ 1)fc/log k (sk) cil+ 1)kllo * k (3.7) 

which is false for k> k 0 provided S is small enough. 

Theorem 3.3 For each A > 1 there exists only a finite number of integers 
k, n , m such that k > 1, n + k < m < An and A(m , k) = 0 (mod A(n , k)) where 
the interval [n + 1, n + k] contains a prime. 

Proof We first pick 6 in Theorem 2.1 sufficiently small and then can 
restrict attention to a fixed integer /, 2 < / < A + <5. By Lemma 3.2 we have 
k < k 0 . Now pick one of the integers x e [n + 1, n + k] so that 
lx £ [m + 1, m + fc]. Then, by the same argument that we used in the proof 
of Lemma 3.1 we have (x, y) < (/ + l)k for every y e [m + 1, m -I- k\ and 
hence, if x | A(m , k) we must have n < x < ((/ + 1 )kf < ((/ + l)k 0 ) ko . 

We have not carried out the detailed estimates needed to show, for 
example, that the example stated at the beginning of this section is the 
unique solution for A = 2, except for A(4 y 2) and /4(8, 2), but it would not be 
difficult to do so. 


4. Open Questions 

4.1. In view of Lemma 2.2, it would be interesting to know the smallest 
m > 2k so that every prime in the interval [k + 1, 2k] divides A(m , k). In 
particular, is it true that m > k c for every cl 

4.2. We know of no example with n > 16, k > 2, where A(n , k) divides 
A(m, k) and n + k < m < 2n. It would be interesting to find a bound for such 
n without the hypothesis that there exists a prime in the interval 
[n 1, n + k]. 

4.3. A question related to those discussed in this paper is to find solu¬ 
tions for A(n , k)\A(n + fc, n + 2k). Charles Grinstead has found the follow¬ 
ing examples: 


n: 

2 

3 

4 

5 

6 

7 

8 

9 

k: 

4 

3 

206 

1886 

3472 

3471 

8170 

8169 
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Introduction 

The Gauss sums for multiplicative characters of finite fields, which play 
such an important role in classical number theory, appear again in the 
functional equation of Abelian L-functions, as local Gauss sums tJj) for 
prime ideals p which are tame at x, i e., divide the conductor to at most the 
first power. Going over to Artin L-functions, one gets analogously tame 
local Galois Gauss sums for arbitrary characters of Galois groups, derived 
from the local constants of Langland. For details see, e.g.. Martinet [4]. 
For tame Abelian local Gauss sums, the quotient 

h(x> 4>) = 

in the nontrivial case when p is actually ramified at 0, and #0, is a Jacobi 
sum, hence an algebraic integer. We shall be concerned with the correspond¬ 
ing “non-Abelian Jacobi sum” 

j P (x, 4>) = Mx) degW,)T .>(0) deg<x)T '> (oi) 

deg(^) denoting the degree of '/, always under the hypothesis that p be tame 
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at x and </>• From the connection between Galois Gauss sums and Galois 
module structure of algebraic integers (cf. Frohlich [1]) and using the theory 
of module resolvents (cf. Frohlich [2]), we shall derive an interpretation of 
the fractional ideals generated by the j p (x , <p ) which then implies that they 
are integral. This interpretation is new and of interest even in the classical 
absolutely Abelian case. Our method will also yield, without further effort, a 
similar interpretation for the quotients of local conductors, which, however, 
lies less deep. 


1. Statement of Results 

Let Q be the algebraic closure of Q in the field of complex numbers. A 
number field K is always a subfield of Q of finite degree over Q, and we write 
Gal(<2/K) = Q x . Throughout N/K is a normal extension of number fields 
with Galois group T and p is a finite prime divisor of K which, except for 
Theorem 1, will be assumed to be tame (i.e., at most tamely ramified) in N. If 
/, cf> are characters of T, then j p (x, <t>) is defined as in (0.1), and k(x , </>) is the 
field obtained by adjoining to the number field k the values of x and </> on T. 

Theorem 1 For co e 

7p(r, r )=;,(*, 

In particular j p (x, <f>) e Q(x , <t>). 

This theorem can be proved quickly. Observe that j p (x , co) is the local 
Galois Gauss sum t p (0) for the virtual character 0 — deg (<p)x + 
deg(/)0 — x4>i an d that det 0 — 1 on T. Then apply the formula for Galois 
action in Martinet [4, II, Theorem 5.1]. 

Denote by R(T) the group ring of T over a ring R. Let £ be a number 
field containing K , and let F, W be £(r)-modules of finite dimension. T will 
also act on the algebra N E via the first factor. If /: V-> N ® K £, 
g : W —► N (x) K E are homomorphisms of £(r)-modules so is the product fg 
given by 

V ®E W ^ ( N (g) K £) (g) £ (AT ® K E) mulliplicalion > N ® K E. 

We get a homomorphism 

p: H v ® E H w —> H v ® w (FI) 

with p(f ®g)= fg , where we abbreviate 

H v = Hom £(r) (F, N (x) x E). 


Theorem 2 p is an isomorphism. 
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Denote completion at p by subscript. Let o, O, o be the rings of algebraic 
integers of K , N, and £, respectively. Let M, N be lattices over o P (T), span¬ 
ning V p and W v , respectively. Abbreviate again 

H m = Horn vr) (M, (O0 D o) p ). 

Thus H m spans Hom £p(r) (L p , (N<g) K E) p ) which we identify with H VtP . 
We again get a homomorphism 

H n ^H m<s>n (® = ® 5p ), ( 1 . 2 ) 

with A p (/® #) = /<?. By Theorem 2 this is injective and has finite cokernel. 
Moreover cok A p is an o p -module, hence an o-module. Denote its order over 
o by N). This is an ideal of o, where we now take E = K(%, <p\ with x 
and (j> the characters of the representations of T afforded by V and by W 9 
respectively. Write moreover t(M, N) for the order of cok >l p , viewed as a 
module over the ring Z(x , (p) of algebraic integers in Q(x , (p ). Then 

t(M, N) = N KU lt , )l Q {X ' ll>) 'X(M, N), (1.3) 

(norm). Also if a> e Cl K and M m is the module obtained from M by semilinear 
action of u> then 

t(M"”‘, AT" -1 )" = t(M, N). (1.4) 

Let now k = K n Q(x, <j>) and let {<x} be a right transversal of £l k in Q q . 

Theorem 3 

t/pOc, 0))=n t(M--\ 

<7 

Corollary y p (£, </>) is an algebraic integer. 

Next let M* denote the contragredient o p (T)-module to M, i.e., 
M* = Hom 0 - (M, o p ). Let f p (x) be the local conductor. 

Theorem 4 

fp(z) deg( ^fp(</>) deg0f) fp(^) _1 = 2(M, N)2(Af*, N*). 

The corresponding corollary is known, even in the wild case (cf. Martinet 
[4, Exercise 2]). 

2. The Evaluation Pairing 

The notation is always that introduced in Section 1. We shall write 
n v : H v ® E V-> N ®* E, n v (f v) =f(v) 
for the evaluation pairing. Let {/•}, {a,}, {g k }, {w k } be E-bases of Hy , V 9 H w , 
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and W, respectively. We know that the £-dimensions of H v and of V coin¬ 
cide and that det (fi(vj)) is an invertible element of N 0 K £. Also 

det ((f igk )(vj 0 w,)) = (det(/.(r j ))) d ^>(det( gk (w i ))) d ^», (2.1) 

where the row index on the left is (i, A), the column index is (./, /). Thus the 
determinant in (2.1) is invertible and this implies that g is injective. Counting 
dimensions we conclude that g is bijective. This yields Theorem 2. 

The above applies equally if we complete at p. Let now {/J, {g k }, {w k } 

be o p -bases of H M , M, H N , and N, respectively. Define 

s(M) = s*(M, (O 0„o) p ) = o p det(/ ( (vj)) (2.2) 

as in Frohlich [2, §3]. This indeed only depends on M, not on the choice 
of bases. By (2.1) 

s(M) degW,) s(JV) deg( * ) = Op deg((/j g k )(vj 0 w,)). (2.3) 

The multiplication of the o p -modules on the left of (2.3) is induced by that in 
N® k E, see, e.g., Frohlich [2, A.2], On the other hand, let {h r } be an o p -basis 
of H m g, y (0 = 0 6o ). Then via (2.3) we get 

s(M) degW s(AO deg()t) = s(M 0 N)X(M, N) p , (2.4) 

following the procedure to compute module indices in this situation given in 
Frohlich [2, A.l]. Indeed one can see this easily. If (a (i>k)>r ) is the matrix 
transforming the basis {h r } of H VS)W into the basis f] g k , then on the one hand 

T(A/, N) p Op det(a (i , k)< r ). 


while on the other hand 

det((/6f*)(oj0 w,)) = det(a ((>l0 , r ) det (h r (vj®w,)). 

Now use (2.1). 

Let g be the multiplication N 0 K E -» NE <= Q, NE the composite field. 
Apply g p to (2.4) and observe that g acts as the identity on fractional ideals 
of Op. We get 

3p(s(M)) d ^»0p(6(N)) degte) = 0p(s(Af 0 N))X(M, N),. (2.5) 


3. Resolvents 

Let aOp(r) = Op. Let T be a representation of T over Q, corresponding 
to the character x■ In Q V = Q 0 K K p we define the resolvent by 

(a| Z ) = det(I^T(y)- 1 ). 
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This is an element of Q* p in fact of ( NE)* P . But (cf. Frohlich [1, Theorem 21]) 

0 »(s(M))= (a|*)v 

Hence by (2.5) 

[(0 |x) degW>) ( a I</>) deg(JC) (a | £</>)“ 1 ]o p = 2(M, iV) p . (3.1) 

Indeed the expression in square brackets on the left lies in E*. View it as an 
idele. It then defines a global fractional ideal of o, “localized” at primes 
above. Similarly £(M, N) is an ideal of o localized at these primes. From 
( 3 . 1 ) we now get, in the obvious notation, the equation 

((a | x) degW (a I </>) deg0f, (« | X<f>r 1 ) = 2(Af, N) (3.2) 

for ideals of o. 

The proof of Theorem 3 now follows from (3.2) and Frohlich [1, 
Theorem 4], using (1.3), and the observation that if {to} is a right transversal 
of Q e in Q Qtx4l) , {<x} one of Q* in Q q , then {wo} is one of Q K in Q q . 

Similarly, the proof of Theorem 4 follows from (3.2) and Frohlich [1, 
Theorem 18]. 
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Let kbea finite abelian extension of the rationals Q. We take “the Hasse norm theorem holds for /c” 
to mean that each nonzero rational number is the norm of an element ofkif and only if it is the norm 
of an element from each completion of k. This paper gives computable necessary and sufficient 
conditions for the Hasse norm theorem to hold for k in the case where k satisfies the following 
properties: (i) [k : Q\ is a power of l where l is any prime , ( ii ) k is the composite of cyclic extensions 
of Q that have pairwise relatively prime conductors , and (iii) the conductor of k is odd. 


Introduction 

In 1931 H. Hasse [7] gave an example, i.e., k = Q(( — 3) 1/2 , 13 1/2 ), that 
showed that the Hasse norm theorem did not hold for arbitrary abelian 
extensions. In 1936 A. Scholz [12] gave conditions for the Hasse norm 
theorem to hold for certain noncyclic extensions of degree l 2 over the ground 
field where / is a prime. In 1967 J. Tate and J.-P. Serre [1, p. 360] gave a 
method for producing numbers that were local norms everywhere but not 
global norms from Q( 13 1/2 , 17 1/2 ). In 1971 C. Pitti [11] and now O. Taussky- 
Todd [13] have given substitutes for the Hasse norm theorem for Klein 
4-group extensions. 
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This paper continues the project started in Garbanati [4, 5] of classifying 
those abelian extensions of the rationals for which the Hasse norm theorem 
holds. 

1. Preliminaries 

We now state those results of Garbanati [5] that we shall use but not 
prove and we introduce some new notation. 

Let k be a finite abelian extension of the rationals Q with Galois group 
G(k/Q). 

If F is a field, let F* be the multiplicative group F — {0}. 

Let p be a prime divisor (finite or infinite) of Q and let / e Q*. Let (/, k) p 
be the Hasse norm residue symbol (cf. Garbanati [5] for a definition). Let N 
be the norm map from k to Q. By an abuse of language we shall take the 
phrase “the Hasse norm theorem holds for fc” to mean that “for any j 3 e Q* 
there exists a ft e k such that Nfi — / if and only if (/?, k) p = 1 for all prime 
divisors p of Q.” 

Let K' be the “narrow” genus field of fc, i.e., the largest abelian extension 
of k that is unramified at all (finite) prime ideals of k and which is abelian 
over Q. 

Let K be the “narrow” central class field of /c, i.e., the largest abelian 
extension of k that is unramified at all (finite) prime ideals of k and which is a 
Galois extension of Q such that G(K/k) is a subset of the center of G(K/Q). 

Let = [K' : k ] and z + = [K : k\ 

Theorem 1 Let k be an arbitrary finite abelian extension of Q. Then the 
Hasse norm theorem holds for k if and only if z + — g + . 

Proof [5, Theorem 1]. 

Theorem 2 Let k 0 and k be arbitrary finite abelian extensions of Q such 
that k 0 id k. If the Hasse norm theorem holds for k 0 , then it holds for k. 

Proof [5, Theorem 2]. 

Lemma 1 Let k 0 and k be arbitrary finite abelian extensions of Q such 
that k 0 id k. Let K 0 be defined for k 0 as K was defined for k. Then K a K 0 . 

Proof [5]. 

Let k* = K f . Let K* be defined for k* as K was defined for k. If G is a 
group, then let (G, G) be its commutator subgroup. 

Lemma 2 Let k be an arbitrary finite abelian extension of Q. Then 
G(K*/k*) = (G, G), where G = G(K*/Q\ 

Proof [5, Lemma 14]. 

Let Q m = Q(e 2ni/m ) where m is a positive integer and let/be the conductor 
of fc, i.e., the smallest positive integer such that k a Q f . 
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If G is a finite abelian group, let G' denote the group of characters of G. If 
n is a positive integer, let G„ denote the multiplicative group of integers 
mod n. Let G = G(Q f /Q) and H = G(Q f /k). Then G(k/Q) = G/H. Let 
£ = e 2nilf . If a e Z (the integers) and a and / are relatively prime, let 
o a :£-*•£“ be a typical element of G. Let t be the isomorphism of G onto G f 
given via o a -* a. Let H f = r(H). Let X = X k be the group of all the charac¬ 
ters of G'f that take on the value 1 at each element of Hf. Let 

/= pi‘ ■■■ Ps s 

be the prime power decomposition of f Since 

Gf — G nt x ••• x G„ s 

where n f = pf (1 < i < s) we can identify the two groups above and write 

G 'f = G ’m x "' * G 'n s > 

thinking of the characters of G' f as s-tuples of characters. For 1 < i < s 
consider the projection map 

pr, : Gf -+ G' m . 

Let X t = pr, X. Let X be the external direct product of the X t for 1 < i < s, 
ie., 

X = dir flX, • 

1=1 

By [8, p. 357] ^ = X K ,. 

Throughout the rest of this section we shall assume that: 

(I) [k : Q\ is a power of l where / is any prime; 

(ii) the conductor / of k is odd. 

Let k 0 be the largest field such that k <=k 0 cQ f and [/c 0 : (?] is a power of 
/. Let X° = X k0 . Since p f (1 < i < s) is odd, X° ( is cyclic. Let x? be a genera¬ 
tor of Af. (We shall require that be a certain type of generator of Xf in 
Section 3.) Let o( ■) denote “the order of.” Let a(i) = o(Xf). Let <5,- = e 2mlx{,) . 
We define [p, , pj (1 <i,j< s) by the conditions 

0 < [Pi, Pj] < a(i), [Pi, Pi] = a(0> 

and 

Z?(Pj) = W"* if J + «. 

Let I(pi | be the inertia group of p t where p f is some prime ideal in k 0 
dividing p t . Let a iko e I(Pi\Pi). (We are using a { instead of (p*) which was 
used in [5].) 
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Fig. 1 


Let K 0 be defined for k 0 as K was defined for k. By Lemma 1 we have 
Fig. 1. 

By Garbanati [5, Lemma 11] there exists a iRo e I(%\pi) (the inertia 
group of Pi where % is a prime ideal in K 0 ) such that a iKo restricted to k 0 is 

*i*o * _ 

Let <7, be a iRo restricted to K *. 

Let X = <= X° = X fco . Let p(i) = o(X t ). Let p(ij) = min{jj(i), /*(/)}■ 

If G is a group, let (g l9 g 2 ) be the commutator g x l g 2 g l g 2 1 and let cen G 
be the center of G. 

Suppose k * = K' is noncyclic. Then for appropriately chosen a iko we 
have the following lemmas. (We shall explain how to choose a iko in Section 
3.) 

Lemma 3 Let G — G(K*/Q). Then G is generated by the cr, (1 < i < s) 
and (G, G) <= cen G. 

Proof [5, Lemma 13]. 

Lemma 4 Let W be the free group with (|) generators (1 <i <j < s). 
Let N be the normal subgroup generated by the relators 


(Wij , W fcc ) 

1 <i <j < s, 1 <b < c < s 

(1) 


1 < i < j < s 

(2) 

n w i &’ pi] 

j<i 

n wfj [Pj ' Pi] 1 <i<s. 

i<j 

(3) 


(Note that i is a fixed number between 1 and s in the last type of relator.) 
Then 


W/N - G(KJk^) 
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via 


W'jN -► (<j t , (7j). 

Proof [5, Lemma 15]. 


2. A Criterion for the Hasse Norm Theorem to Hold 

Throughout the rest of this paper we shall define G via 

G = G(KJQ) 

and H via 


H = G(KJk). 

Let (G, H) be the normal subgroup of G generated by the elements by 
(g, h) where g e G and h e: H. 

Lemma 5 Let k be an arbitrary finite abelian extension of Q. Then 
G{KJK) = (G,H). 

Proof Let E = G(K*/K). Let ge G and h e H. Then (g, h) e E, i.e., 
gEhE = hEgE because H/E = G(K/k) <= cen G(K/Q) = cen G/E. Thus 
(G, H) <= E. Now to show E <= (G, H). Let K be the field fixed by (G, H). 
Since (G, H) c (G, G), it follows from Lemma 2 that K k* => k. Also K* is 
unramified over A*, which in turn is unramified over k. So K is unramified 
over k. Since g(G, H)h(G, H) — h(G, H)g(G, H), we get_ G(K/k) = 
H/(G, H) c cen G/(G, H) = cen G(K/Q). Therefore, K <= K and so 
E c (G, H). 

Lemma 6 Let k be an arbitrary finite abelian extension of Q. The Hasse 
norm theorem holds for k if and only if (G, G) = (G, H). 

Proof By Theorem 1 the Hasse norm theorem holds if and only if 
K = /c + which happens if and only if (G, H) = G(K*/K) = G^^/kf) = 
(G, G). 


3. The Structure of (G, G)/(G, H) for Fields with Properties (I)-(IV) 

In this section, for a specific type of extension of Q, we shall: (a) find a set 
of generators of H (Lemma 7), (b) find a set of generators for (G, H) (Lemma 
8), and (c) give a matrix method for determining the invariants of 
(G, G)/(G, H). 

Throughout this section we shall assume that k is an abelian extension of 
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Q with the following properties: 

(I) [k : Q] is a power of / where / is any prime. 

(II) The conductor / of k is odd. 

(hi) k is noncyclic. 

(IV) k is the composite of cyclic fields with pairwise relatively prime 
conductors, i.e., 

k = L ! -L t 

where each L d (1 < d < t) is a cyclic extension of Q and if/ d and f d , are the 
conductors of L d and L d ,, respectively, and d =£ d', then f d and/ d , are relatively 
prime. 

We shall now produce X = X k . Let (1 < d < f) be defined via 
S d = {i 11 < i < s and p t | Q. 

Without loss of generality we may assume that the p, (1 < i < s) are ordered 
so that if 1 < d < d' < t and i e S d and i' e S d ,, then i < i'. Let i j/ d be a 
generator of X Li . Then 

Xk = X Li X ••• X X Ll ={\j/ di y x ••• x 

If i € S d , let Xi - pr, i}/ d . As mentioned in Section 1, we now restrict i.e., 

must be a generator of A? having the property (x?Y (l)lfi(i) = Xi- Let 
V; = e 2*WHi)' Then 

Xi(Pj) = v\ Pi ' Pj] if j ± i- 

Let k t be the largest subfield of Q„. such that [/c ; : Q\ is a power of /. In 
Section 1 we stated that Lemmas 3 and 4 held for appropriately chosen 
a ik0 e /(p, | where p f is a prime ideal in k 0 . To be more precise this means 
a iko generates /(p,|p,) and a iko satisfies the following relation for all j 
(1 <j < s) [5; 2, p. 238]: 

(Pj > = rto P,i (4) 

where (p,) = p, Z. (For this equality we are really thinking of a iko as res¬ 
tricted to /q.) We shall now produce such a a iko . Let a\ be an integer (rela¬ 
tively prime to n ( ) such that x?( a i) = &i an d hence / f (aJ) = v f . Let a, be an 
integer such that 

a t = a\ mod n, and a { = 1 mod f/rii . 

Then 

Xi(fli) = V, (5) 

Let £ = e 2nilf . Let a iko be the automorphism on k 0 given by restricting £ -*• £‘ ,i 
to k 0 . Then er ito generates /(p ; | p t ) and for ij=j we have x?(a l i Pi ’ Pi] ) = X?(Pj)- 
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Therefore if j + i, then 

(Pj> k i)(Pi)= (^) =<r l £o Pj] - 

where (4-) is the Artin symbol. If j = i, then (4) holds trivially. 

Let c(d) (1 < d < t) be an integer in S d with the property that 

P(c(d)) > p(i) for all i e S d . 

If i e S d , let 

H(i) = p(c(d))/p{i). 

(Note that i determines d.) If i e S d , let 

T - ) f-r 1 

T i “ °c{d) G i • 

Lemma 7 The automorphisms t* (1 < i < s) and the elements of (G, G) 
generate H. 

Proof Since fc* fc, we get by Lemma 2 that (G, C) c= H. We now show 
t i e H where i e S d . For this proof, let b t = a$ d \. Then restricted to k is 
C -> C biari restricted to k where af 1 is the multiplicative inverse of a t in the 
integers mod /. But by (5) we get 

iMMf *) = Xc(d)«d))Zi(«.) _1 = vsgvr* = l. 

Thus t i e H. 

Now we need only show that any automorphism in H can be written as a 
product composed of the x { and elements of (G, G). Let h e H. Since h g G, it 
follows from Lemma 3 that 


h = h t h t g 

where g e (G, G) and for 1 < d < t 

i 6 Sd 

Let all products which follow in this proof run over all i e S d . Since h e H 
we get h d fixes L d . Furthermore, h d restricted to L d is f -► C m(d) restricted to L d 
where m(d) = Y[ °h ei - Thus 

1 = il/ d {m(dy l ) = [] Xi(ai) ei = FI Xc{d)(bi) e ‘ = Zc(d)(FI *>?')• 

Therefore C -* C nid) restricted to is 1 where n(d) = J ] by. Thus a^ ei 
restricted to k „ is 1, i.e., ]”[ <J$ d) ei e ( G, G). So we have 

h d (G, G) = n <\ ei °riG, G) = n Tf ‘(G, G). 

Before proving the next lemma we note that if a, b, and c are elements of 
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some group E and (£, E) c= cen E, then 

(< a , be) = ( a , fe)(a, c) and {ab , c) = (a, c)(fe, c) (6) 

= ( 7 ) 

(a, &) = (!>, a)" 1 . (8) 

Lemma 8 The following elements generate (G, H): 

(<r C (d ), ffc(d'))'‘ <i> " <i ' > (ffi. 1 (9) 


w/iere z g S d and i' e S d , and 1 < d < d! < t and i < i'. Note that if d = dthen 
(9) reduces to (g i9 Gi)~ x . 

Proof We first show that an element of the type (9) is in (G, H). By (6), 
(7), and Lemma 7 

, °i')- 1 = Kd) > T |( ) e (G, H). (10) 

Similarly, 

(a v ,o t ^(a v ,a i )- l s(G,H) (11) 

Therefore by (8), (10), and (11) elements of type (9) are in (G, H). 

Now we show each element of (G, H) is a product of powers of elements 
of type (9). By (6) and Lemmas 3 and 7, (G, H) is generated by (<r ; , T j( ) where 
1 < i, i' < s. Suppose i < i'. Then 

(°i . T i') = ( ff i > 1 

= [( CT c(d) > <T i'(i(')) ,i|l) ( ,T i » ff c(d')) '] M<,) 

x [(«■*» - ff t (d-))'‘ <0 '‘ <i ' ) ( <T i > °V)~ ']- 

If i < c(d'), then each element in the brackets is either of type (9) or is 1. If 
i > c(d'), then d = d' and the first component above can be written as 
Kdo> which is a power of an element of type (9). Suppose i > i'. 

Applying (8) to the right-hand side of the above shows (a ,, T f ,) is a product of 
powers of elements of type (9). 

Lemma 9 Let k satisfy (I)-(IV). Let W be the free group with ( 2 ) genera¬ 
tors w t j , 1 < i < j < s. Let R be the normal subgroup generated by the relators 
given in (1), (2), and (3) and by 

<dftd')W. : H, 1 (12) 

where i e S d , i' e S d ,, and 1 < d < d! < t and i < i'. Then 
W/R ~ (G, G)/(G, H). 

Proof This follows from Magnus et al. [9, p. 71] and Lemmas 4 and 8. 
We shall now produce a matrix procedure for computing the invariants 
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of (G, G)/(G, H). Let r = ( s 2 ). Order (i, j), 1 < i < j < s, lexicographically, i.e., 
(1, 2), (1, 3), ..(1, s), (2, 3), ..., (2, s), ..., (s - 1, s) (13) 


Let this ordering determine the position of the entries in the following 
matrices. 

Let A be the r x r diagonal matrix A = diag[/J(i, /)] where using the 
ordering of (13) /?(/,;) is in the (i, j)th row and column. 

Let B be the s x r matrix where the ith row is obtained from (3) as 
follows. Let 1 <j <s,j f i. If j < i, put [pj, p t ] in the (j, i)th column. If i <j, 
put — [pj, pj in the (i, /)th column. Put zero elsewhere in the ith row. 

Let E be the r x r matrix where the (i, i')th row is obtained from (12) as 
follows. Let i e S d and i' e S d , . If d = d\ then put 1 in the (i, i')th column and 
put zeros elsewhere in the (i, i')th row. Suppose d f d'. If (i, /') = (c(d), c(d')), 
put zeros everywhere in the (i, i')th row. If (i, i') =4 ( c (d), c(d')), put p(i)p(i') in 
the (c(d), c(d'))th column and put — 1 in the (i, i')th column. Put zeros 
elsewhere in the (i, i')th row. Let Oj be the r x (s + r) zero matrix and 0 2 the 
s x (s + r) zero matrix. Let m = 2r + s. Let M be the m x m matrix 


M = 


A Oj 
B 0 2 
E Oi 


The Smith normal form of an m x m matrix M over Z is the unique 
matrix that results from applying elementary row and column operations to 
M so as to obtain an m x m diagonal matrix 


diag(zj,.... z e , 0,..., 0) 

where z ( | z i+ 1 for 1 < i < c — 1 and each z, is a positive integer [10, p. 26]. 
The z„ ..., z c are the invariant factors of M. 

Theorem 3 Let k satisfy (I)-(IV). Let M be the m x m matrix described 
above. Then there are precisely r invariant factors of M, namely, z u z r . 
Suppose Zi = • • • = z e = 1 and z e+ , =/= 1. Then z e+1 , z r are the invariants 
°f (G, G)/(G, H) ~ G(K/K'). 

Proof The same proof as was given in Garbanati [5, Theorem 11] works 
if we replace A/A a R by ( G, G)/(G, H). 

lib e Z, let 6 be b mod /. If C is a matrix over Z, let C be the matrix over 
F, (the Galois field with l elements) obtained by replacing each entry b of E 
with 5. 

Let C - [f]. 


Corollary 1 Let k satisfy (I)-(IV). Then (G, G)/(G, H) is the trivial group 
if and only if rank C = r. 

Proof Let and U 2 be unimodular matrices such that 17, MU 2 = 
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diag(z l5 ..., z r ) -i- 0 3 where 0 3 is the (s 4 - r) x (s + r) zero matrix. Then 
(G, G)/(G, H) is trivialoe = ro rank V l MV 2 = r<=>rank C — r. 

4. The Hasse Norm Theorem for Fields with Properties (I)-(IV) 

In the following theorem we find a new matrix D over F t such that 
rank D = ( 2 ) if and only if rank C = r and then we use Lemma 6 and Corol¬ 
lary 1 to give a necessary and sufficient condition for the Hasse norm 
theorem to hold for k in terms of the rank of D. 

We now define D. Let k satisfy (I)-(IV). In particular 

k = L 1 -L f 

is the composite of t cyclic extensions of Q (namely, L d (1 < d < t)) which 
have pairwise relatively prime conductors. Let S = {p u ..., p q } be those 
primes which ramify totally in some L d (1 < d < t). Note we are not fixing d 
here, i.e., perhaps ramifies totally in L d and p 2 ramifies totally in L d , where 
d f d'. Let \j/ d generate X Ld , i.e., 

*u = <^> 

where X u is the character group associated with L d . Let a d = [L d : Q]/l Let 
co = e 2lti!l . Let f d be the conductor of L d . Let 

S d = {j\Pj G S and pj \ Q. 

If pj e S but j <£ 5 d , define a dj (0 < a dj < l) via 

rAPj) = w adj - 

Let u = (2). Order the u pairs (d, d') where 1 <d<d'<t lexicographically 
(cf. (13)) and use this ordering for the columns of the following q x u matrix 
D (over F z ). The jth row of D is determined as follows. If j e S d and 1 < d < 
d! < f, put — a d ,j in the (d, d') th column. If j e S d ,and 1 < d < d! < t, put a dj 
in the (rf, d')th column. Put zeros elsewhere in the 7 th row. 

Theorem 4 Let k satisfy (I)-(IV). Let D be the matrix over F t defined 
above. Then the Hasse norm theorem holds for k if and only if rank D = u. 
Proof We shall show that 

rank D = uo rank C = r (14) 

and then the theorem will follow from Lemma 6 and Corollary 1. We shall 
now prove (14) by performing a series of row and column operations on C 
and then simplifying and analyzing the matrix that results. In what follows 
let ERO (resp. ECO) stand for elementary row (resp. column) operation. 
Consider the E part of C. We use the notation employed to define E. For 
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the (i, i')th row of E , there are two cases. Either d = d' or d =/= d'. If d = d\ 
then use the 1 in the (i, i')th row and column of E to change (via ERO) all 
other entries in the (i, i')th column of C to 0. If d =/= d' and (/, i f ) ± 
(< c(d ), c(d% change the i'th row of B as follows. Multiply the (i, i')th row of E 
by [pi , and add the resulting row to the i'th row of B . Also change the ith 
row of B as follows. Multiply the (i, i')th row of E by — [p v , p] and add to 
the ith row of B At this point if (i, i') ± (c(d)> c (d')) and d =f= d' the (i, i')th 
column of C will contain — 1 in the (i, f)th row of £ of C and zeros else¬ 
where. Use this --1 (via ECO) to change to 0 the p(i)p(i') in the (i, i')th row 
and ( c(d ), c(d '))th column of E. Now change this — 1 to + 1 (via ERO). 
Change the (c(d), c(d')) column (via ECO) into a ( d , d') column, i.e., move it 
to the (d, d') column position. Since 1 < d < d! < r, there are u pairs (</, d'). 
We have now transformed C into the direct sum of two matrices B 0 and E 0 , 
i.e., C = B 0 + E 0 where B 0 is an s x u matrix and E 0 is an r x (r — u) 
matrix. But E 0 can be transformed (via EROs and ECOs) into [ 7 v] where 
/ r _ u is the (r - u) x (r - u) identity matrix and 0 is the u x (r — u) zero 
matrix. We now have 


rank C = ro rank B 0 = u. (15) 

What does the jth row of B 0 look like? If j e S d> , then in the (d, d ')th 
column (d < d') we have 

Z v(i)n(j)[Pi, Pjl (16) 

i e Sd 

If j g S d , then in the ( d , d ')th column (d < d') we have 

I ~p(jW)[Pi ,Pj]- (17) 

i' e Sd/ 

if; i s d u S d ,, then we have 0 in the (d, d')th column. 

Let j e S h where 1 < h <t, i.e., Pj \ f h . Since the ramification index of pj in 
L h is o(/_j) = P( j) [8, p. 358] and since [L h : Q] = o(X L) ) = P(c(h)), we get that 
Pj ramifies totally in the cyclic field L h if and only if P(c(h)) = fi( j) if and only 

if I'M;)- 

Consider B 0 . If pj does not ramify totally in L h where j e S h , then by (16) 
and (17) the jth row of B 0 is the zero vector. If pj ramifies totally in L h where 
j e S h (i.e., p(j) = 1), and if; $ S d , then 

hpj )=n xApj )=n w] =n pj] 

where these products and the product that follows run over all i e S d . Let 
a d — P(c(d))/l. Then v% d) = a> where at = e 2nil ‘. Thus 

w=n^ )[f " w| . 


( 18 ) 
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If j 4 S d , define a dj (0 < a dj < l) via 

ip ad (pj) ~ co adj . (19) 

By (18) and (19) 

a dJ = I n(0[Pi , Pj] mod l. (20) 

i e 

By (20) B 0 is in fact the s x u matrix given by the following prescription for 
the jth row. If pj does not ramify totally in L h where j e S h , then let the y'th 
row be the zero vector. By (16), (17), and (20) we have the following. If pj 
ramifies totally in L d , where j e S d ,, then in the (d, d') th column (d < d') we 
have a dj . If pj ramifies totally in L d where j e S d , then in the (d, d')th column 
we have — a dlj . If pj ramifies totally in L h where j e S h but j 4 u S d ,, then 
we have 0 in the (d, <f)th column. Hence rank B 0 = rank D and we are done, 
i.e., (14) now follows from (15). 

Let T be the set of all pj (1 < j < s) that ramify totally in L c for some c 
(1 <c<t) (i.e., pj e S) and which, in addition, split in L d for all d =f= c. Let 
# (•) stand for “the number of elements in.” 

Theorem 5 Let k satisfy (I)-(IV). If q — # (T) < m, then the Hasse norm 
theorem does not hold for k. 

Proof If pj e 7, then p } e S and so p } determines a row in D. Let be 
the number of primes in L d that divide Pj . Then g } is the order of the set of all 
such that Pj does not divide the conductor of / and simultaneously 
%(p.) = l [8, p. 356]. Therefore since Pj splits in each L d where d 4= c, we have 
\jj d d (Pj) = 1 for all d 4= c. Therefore the row determined by p } is the zero row 
and so there are at most q — #(T ) nonzero rows in D. The result now 
follows from Theorem 4. 

The following theorem generalizes Garbanati [5, Theorem 10]. 

Theorem 6 Let k satisfy (I)-(IV). Let k be the composite of t cyclic 
extensions of Q , namely L d (1 < d < t), which have pairwise relatively prime 
conductors . Let q be the number of primes that ramify totally in some L d 
(1 <d <t). If q < ( 2 ), the Hasse norm theorem does not hold. 

Proof This is immediate from Theorem 4. 

Example Let p u ..., p 5 be distinct odd primes. Let 

k = Q((eiPiP 2 ) 1/2 > ( e 2 P 3 ) 1/2 > (e 3 P4) 1/2 , Ms) 1 ' 2 ) 

where = ± 1 (1 < i < 4) and £ x p x p 2 = 1 mod 4, £*/?/+1 = 1 mod 4 
(2 < i < 4). Then k satisfies (I)-(IV). Since q — 5 and u = 6, the Hasse norm 
theorem does not hold for k. 

Corollary 2 Let k satisfy (I)-(IV). Let k be the composite of four or more 
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cyclic extensions L d of Q that have pairwise relatively prime conductors. If 
precisely one prime ramifies totally in each L d , then the Hasse norm theorem 
does not hold for k. 

Proof In this case q = t > 4, but then q < ( 2 ). 

The following theorem generalizes Garbanati [5, Theorem 8 ]. 

Theorem 7 Let k satisfy (I)-(II). Let k = L x L 2 be the composite of two 
cyclic extensions of Q (namely L x and L 2 ) that have relatively prime conduc¬ 
tors. Then the Hasse norm theorem holds if and only if there exists a prime 
pe Q that ramifies totally in either L x or L 2 and at the same time does not split 
in k ( i.e ., only one prime in k divides p). 

Proof Let p u ..., p q be the primes that ramify totally in either L x or L 2 . 
We can assume p l9 ..., p n ramify totally in L x and p n+l9 ..., p q ramify totally 
in L 2 . Then D is the transpose of the matrix 

[ — 021 ~ a 22 " ~&2n a ln+l a lq\- 

By Theorem 4 the Hasse norm theorem holds if and only if one of the entries 
is not zero, i.e., either il/ a 2 (Pj) 1 for some j where 1 < j < n or il/ a x (Pj) 1 
for some j where n + 1 <j < q and where a d = [L d : Q]/l (1 < d < 2). Let Qj 
be the number of primes in k that divide pj. As in the proof of Theorem 5 it 
follows from Leopoldt [8, p. 356] that if 1 < j < n, then gj = 1 if and only if 
\l/ a 2 2 (Pj) =/= 1. If n + 1 < j < q, then gj = 1 if and only if ^(P/) =£ 1- The result 
now follows. 

Such a theorem produces numerous examples where the Hasse norm 
theorem holds in the noncyclic case. 

Example Let k = Q((mpi) 1/2 , p! /2 ) where p x and p 2 are distinct primes 
and where m is an integer not divisible by p x or p 2 . Suppose mp x = 1 mod 4 
and p 2 = 1 m ^d 4. If ( p x /p 2 ) = -1, the Hasse norm theorem holds for k 
(where (h-) is the Legendre symbol). This is because p x does not split in k if 
and only if (p x /p 2 )= -L 

Corollary 3 Let k satisfy (I)-(IV). Using the notation of Theorem 6 let 
k = L x - L 2 and suppose [L,: Q] = l for i = 1, 2. The Hasse norm theorem 
holds for k if and only if there is a prime dividing the conductor of k that does 
not split in k. 

For more information pertaining to this corollary, see A. Scholz [12]. 

Theorem 8 Let k satisfy (I)-(IV). By (IV) k = L x * • * L,. If for some pair 
{L t , Lj} where 1 < i < j < t we have that each prime p that ramifies totally in 
either L { or Lj splits in the composite L t • L,, then the Hasse norm theorem does 
not hold for k. 

Proof This is immediate from Theorems 2 and 7. 
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Theorem 9 Let k satisfy (I)-(IV). Using the notation of (IV) suppose 
k = Lj • L 2 • L 3 . Let q — 3. ///or some i where 1 <i <3 we have for all j f i 
that pj splits in L t , then the Hasse norm theorem does not hold for k. 

Proof Since in this case 


det D = a 3l a 12 a 23 — a 2 i a ii a \3 

it follows (as in the proof of Theorem 5) from [8, p. 356] that under the 
hypotheses of the theorem det D = 0. 
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For an algebraic number field E , let 0 E denote the ring of integers of E 
and U E the units of & E . Let K be an imaginary quadratic number field, 
K = Q (y/—D), where D is a positive square-free integer. Let F be a totally 
real extension of Q and L = KF the compositum of K and F. In this paper 
we consider the following two types of scalar extensions of binary lattices in 
K. 

First, if 21 is a fractional ideal of K , we can form the fractional ideal $1(9 L 
of L. 

Second, suppose £ is a lattice in a definite binary quadratic space V over 
Q having discriminant —D (modulo squares). Then V is similar to the 
quadratic space K with quadratic form n(x) = xx J , where J is the complex 
conjugation automorphism of K (cf. [1, §5.1]). Hence we may identify V with 
a quadratic space having underlying space K and quadratic form c * n for a 
suitable rational number c. We can form the tensor product £ = £ <g) z (9 F . 
Then £ may be regarded as an 0 F -submodule of L, but it need not be a 
fractional ideal of L, even if £ is a fractional ideal of K. 

For each type of scalar extension, we have an appropriate notion of 
equivalence. In the first case it is the usual equivalence ~ of fractional 
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ideals; in the second it is isometry between quadratic lattices. In this paper 
we present two theorems on the behavior of these equivalences with respect 
to the corresponding types of scalar extension. The first theorem generalizes 
a result of Kitaoka [3, Corollary to Theorem 2]. The second is the binary 
case of Theorem 2 in Kitaoka [3]. Our method is Galois-theoretic, and as 
such, differs completely from Kitaoka’s, which relies on a study of the 
minima of positive definite lattices upon totally real extension. 

Let the group of roots of unity in L be denoted by iV'. If F is a Galois 
extension of Q, we shall identify Gal(L/K) with Gal(F/Q) and denote it by G. 
Since J commutes with all complex embeddings of F, e 1 “ J is a root of unity 
for any unit e of L (cf. [2, p. 53]). Denote the subgroup of all roots of unity in 
L of this form by i r . Let v = card (i r \ w = card(^). Since 'V contains all 
squares of elements in iV', we have either w = v or w = 2v. 

Lemma 1 Suppose F is Galois over Q. Let 21 be a fractional ideal of K 
such that *21(9 L is principal. Then 2F ~ ©! • • • S ] 8 S , where the i= 1, s, 
are distinct primes of K that ramify over Q. 

Proof By assumption, 21G L = a e L. For each o e G we have 
ol° (9 p = (%U9 l Y = 21G L = aG L , from which it follows that a 1-<T = £, e e U L . 
Then (a l ~ J ) 1 ~ <T = s 1 ~ J , e 1 ~ J ei r . Putting P = cc v , © = 9F, we see that 
(p 1 ~ J ) 1 ~° = 1, so that (P 1 ~ J y = p 1 ~ J for all o e G. It follows that p x ~ J e K 
and © 1_J = P 1 ~ J (9 K . Thus © — © J , which implies © ~ ^ ••• © s , where 
the i = 1, ..., s, are distinct primes of K ramified over Q. 

Lemma 2 Let (9 be an order of K (not necessarily maximal). Then 
(9 n U L = U F except if (9 = <9 K when D — 1, 3. 

Proof We can write (9 = Z * 1 + Z • /w, where/is a positive integer and 
w = yj-D or w = (1 4 - y/—D)/2, according as -D = 2, 3 ( mod 4) or 
— D = 1 (mod 4), respectively. Suppose first that w = and 

s e & n U h . Wri te 8 = a -f bfyj -D, a,be(9 F . Assume e j F. Then 
8 - 8 J = 2fe/y — D, so that (1 - e J “ a )G l = (e - e J )G L = (IbfyJ-D)(9 L . Now 
8 J ~ 1 — £, a root of unity, say a primitive mth root of unity. Then 

(1 - 00 L = (2 bfJ^D)(9 L . (1) 

If m is not a prime power, then 1 — f is a unit [4, p. 267], contradicting the 
fact that 2 divides the right-hand side of (1). Hence m = p‘,p a prime. Since 2 
divides the right-hand side of (1), p = 2. But then 

(1 - tr 2,) 0L = 2 (9l , 

where q> is the Euler function. Hence <p(2') = 1, t = 1. Then be U F ,/— 1, 
D = 1, which implies { = — 1, e = f\/— 1, (9 = (9 K . 
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Suppose now that w = (1 + yf— D)/2 and e e (9 n U L , ee U F . If 
e = a + bfw, then 

(l-Q(9 L = (bf^D)(9 L , ( 2 ) 

where again C = e J ~ *, a primitive mth root of unity. Since D = 3 (mod 4), 
the right-hand side of ( 2 ) f @ L , so m = p', p a prime, and 

(1 - Q* iP,) &L = P&L ■ 

It follows that p is the only rational prime which can divide D. Hence D = p, 
b e U F , f = l. Furthermore, q>(p’) = (p - 1 )p‘~ 1 = 2, which implies p = 3, 
t = 1. Thus C = (— 1 ± V=3V2, e = K, and G = (9 K , completing the proof. 

Theorem 1 Let F be a totally real Galois extension of Q such that t; is 
relatively prime to [F : Q], Let 91 be a nonprincipal ideal of K. Then 9 IF; is 
principal if and only if there exist distinct primes s Fi, • ■ ■, 9L of K ramified over 
Q, <p? = p, dp for rational primes p t , i = 1,..., s, and an element a e F, such 
that 

(a 2 )C> F = (Pi ••• p s )0 F ■ 

In this case, the class of 91 has order 2 and p u ..., p s ramify in both K and F. 

Proof We need to prove the “only if’ part of the statement, as the other 
is trivial. By Lemma 1 we know 9 P ~ 'Fi '" 4 ! s for distinct primes fj ls ..., 
^ of K ramified over Q. If 9lF t is principal, then 9l m ~ (1 ) K , where 
m = [L: K] = [F : Q]. Since (m, v) = 1, we have 91 ~ 9P" ~ (<P, • • • for 
some integer n. By assumption, 91 is nonprincipal, so n, v must both be odd, 
and we must have 91 ~ 9P ~ ^ •• ^ 8 S . We may assume that 
= vp, -■ <p s . Then 91" = {p t • • • p 5 ) ,< qs 1 ■ %, where l = {v - l)/2. From 

the proof of Lemma 1, we have 9F = fl(9 K , where But 

( 9 jt;)i-j _ SQ pi-j g since K has class number > 1, U K = { + 1}. 
Thus p J =±fi. If P J = P, take a = (p, • p s )~ l P- If P J =~P, take 

a = cj-Dp where c e Q is chosen so that a 2 is square-free. 

Remark The assumption on v in the statement of Theorem 1 is neces¬ 
sary. This can be seen by means of the following example. Let 
K = Q( v /-23), a field of class number 3. Consider the cubic extension 
E = Q(a), where 4a 3 — 9a - 2 = 0. Since the discriminant of the latter poly¬ 
nomial is 144-69, E is totally real and its normal closure F is given by 
F = £( v /69). Note that L = KF contains a primitive cube root of unity. Put 
a = a + by/ — 23, where b = (4a 2 — 3) _ *. One easily checks that a 3 = 2 + 
v /-23. Put 91 = the fractional ide al of K generated by 3 and 
(1 — v / -23)/2. Then 9I 3 = (2 -h y/— 23)C ) K , so 91 represents an ideal class 
of order 3 and 9l(9 L = ocU L . Observe that in this case w = 6 = [F : Q], which 
implies (v, [F : Q]) > 1. 
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Corollary 1 (Kitaoka [3]) Let F be a totally real extension of Q with 
discriminant relatively prime to that of K. Then every nonprincipal ideal of K 
remains nonprincipal in L. 

Proof The discriminant of the normal closure of F must also be rela¬ 
tively prime to that of K. Hence we may assume that F is Galois over Q. If K 
has a nonprincipal ideal, then D f 1, 3, which implies, by Lemma 2, that 
(9 k n U L ~ U F . However, since K , F have relatively prime discriminants, 
& K = ® L ' Thus U L = U Fy v = 1, and Theorem 1 is applicable. As there are 
no primes ramifying in both K and F, every nonprincipal ideal of K remains 
nonprincipal in L 

Corollary 2 Let £ p be a primitive pth root o f uni ty , where p is a prime = 
3 (mod 4). Then every nonprincipal ideal of Q(y/—p) remains nonprincipal in 
the extension Q(C p ). 

Proof In this case K = Q (y/-~p) 9 F = the maximal real subfield of 
Q(( p ), L = Q(£ p ). We observe that [F : Q] — (p — l)/2 and w = 2 p. Hence 
(i>, [F : Q]) = 1 and Theorem 1 is applicable. The result follows from the fact 
that K has odd class number. 

Theorem 2 (Kitaoka [3]) Let £, 90? be definite binary quadratic lattices 
over Z and F a totally real extension of Q. //E is an isometry of 2 <8) Z (9 F onto 
90? 0 Z (9 F , then E(£) = 90?. 

Proof The underlying quadratic number fields of £, 90? are determined 
by their discriminants. If £, $1 are isometric, then £, 9>Jt have the same 
discriminants. Hence we may assume tha t bot h are lattices in a single ima¬ 
ginary quadratic number field K = Q(y/—D). Furthermore, after scaling 
suitably, we may assume that the quadratic form on 2 is the norm form n of 
K , while the quadratic form on 90? is c • n, c a positive rational number. 
Composing E with J, if necessary, we may take E to be proper. Then E is a 
proper similitude of norm c on the quadratic space (K, n). Any such proper 
similitude is multiplication by a nonzero element of L [1, p. 28]. Thus 
®? = a£ with n(a) = aa J = c. It suffices to show that a e K. From 90? = a£ it 
follows that £, 90? have the same order 0. We may as well assume that 
0 f (9 K when D = 1 or 3. Furthermore, we may suppose that F is Galois 
over Q. Imitating the proof of Lemma 1, we see that a 1-<T e <9 n U L for all 
o e G. Thus <x}~° e U F for all a e G, by Lemma 2. This implies cl 1 ~ j e K, 
which, in turn, implies a 2 e K. If a e K 9 we are done. If a ^ K, then K(oc) is a 
quadratic extension of K and, as a consequence of the Galois structure of L 
over Q, K(a) is the compositum of K and a real quadratic field Q(>/^) — F. 
Thus we may as well assume F = Q( x /r), L = K(oc). Let t be the nontrivial 
element of G. Then a T = —a. Suppose £ = ZA X + ZA 2 , 90? = Z^j + Zp 2 • 
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Then $2 = aQ implies there exist a t , a 2 , a 3 , a 4 e & F such that 

a 1 Hi+a 2 n 2 = ^u a 3 Hi + d4.n 2 = xA 2 (3) 

with a,a 4 - a 2 a 3 e U F . Applying x to the equations (3) and adding, we 
obtain a] = — a,, i = 1, 2, 3, 4. Hence a ; = b t yfr for some h, e Z, i = 1, 2, 3, 
4. Then aj a 4 - a 2 a 3 = r(hj h 4 - b 2 f> 3 ), which implies r = 1, a 
contradiction. 
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Any cubic extension of a number field is shown to be the resolvent cubic field of a generically 
irreducible quartic polynomial whose Galois group is the alternating group A 4 . This is applied to 
the question of which groups can occur as Galois groups in finite-dimensional division algebras. 


Introduction 

Suppose k c= Lare algebraic number fields. Following the terminology of 
Schacher [2] we shall say L is k -adequate if L is a maximal subfield of a 
finite-dimensional division algebra with center k. A finite group G will be 
called /c-admissible if G is the Galois group of a normal extension L of k with 
L fc-adequate. 

In Schacher [2] it was asked whether the alternating group A 4 is Q- 
admissible, Q the field of rational numbers. We show in this note that in fact 
A 4 is /c-admissible for all algebraic number fields k. This will follow from a 
general construction which realizes any cubic field as the resolvent cubic 

t This work was supported under grant GP-33580, Basil Gordon, and GP-28696, Murray 
Schacher. 
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field of a covering quartic whose Galois group is A 4 and which satisfies 
prescribed local irreducibility conditions. We also study more closely the 
groups that are admissible over all number fields, with particular emphasis 
on the structure of their Sylow subgroups. 

For notational purposes, a division ring D finite-dimensional and central 
over a field k will be called a ^-division ring. Throughout this paper we use 
freely the theory of valuations or primes for algebraic number fields, and the 
theory of Hasse invariants of k-division rings. If k is a number field and q a 
prime of k, we write k q for the completion of k at q. The order of a finite 
group G is denoted by | G |. 


1. Groups Admissible over All Number Fields 


It was proved in Schacher [2, 4.1] that all Q-admissible groups have 
metacyclic Sylow subgroups. The following theorem is of the same type, but 
with a stronger hypothesis and a stronger conclusion. 

Theorem 1.1 If a finite group G is k-admissible for all algebraic number 
fields k, then every Sylow subgroup of G is abelian and generated by two 
elements. 

Proof Suppose G satisfies the hypothesis of Theorem 1.1 and P is a 
Sylow subgroup of G, \P\ = p a , pa prime. We let fc = Q(e p a) where s p a is a 
primitive p a th root of unity. Since G is k-admissible, we have by [2, 2.6] G is 
the Galois group of a normal extension L of k, so that P is a subgroup of the 
local Galois groups Gal (L q Jk qi ) and Gal (L q2 /k q2 ) for some pair of primes q x 
and q 2 of k. Let M f (i = 1, 2) be the field corresponding to P in the Galois 
correspondence, so that P = Gal(Lg f /M*). The rational prime p has a unique 
totally ramified extension in k; thus one of the q ( , say q x , does not lie over p. 
It follows that the extension Lq l /M 1 is tamely ramified. Since is in M u 
we conclude by Albert [1, Theorem 4] that P is abelian. By [2,4.1] P is either 
cyclic or the direct product of two cyclic groups. This completes the proof of 
Theorem 1.1. 

It follows from Theorem 1.1 that the symmetric group S 4 is not admis¬ 
sible over all number fields. This answers a question raised in Schacher [2]. 
By Schacher [2, 2.8] S 3 is admissible over all number fields, and this shows 
that such groups need not be abelian or nilpotent. Of course A 4 satisfies the 
conclusion of Theorem 1.1; we shall show presently that it is admissible over 
all number fields. A s also satisfies the conclusion of Theorem 1.1; we do not 
know if it is admissible over all number fields. By Theorem 1.1 A„ is not 
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admissible over all number fields for n > 6 since its 2-Sylow subgroup is 
not abelian. We are not even aware of whether A 6 and A 7 are g-admissible. 

2. Covering a Resolvent Cubic 

Suppose k is a number field and h(z) = z 3 + az + b e k[z\ an irreducible 
cubic. Let 9 U 0 2 , 0 3 be the roots of h. We set 

f(x) — x 4 — (6y 2 + 2a)x 2 + 8(y 3 + ay + 6)x — (3y 4 4- 6 ay 2 + 12 by — a 2 ) 

(i) 

where y is a parameter that will be specialized in k. It is easily verified that 
the resolvent cubic of/(x) is 

g(w) = w 3 + (6 y 2 + 2a)w 2 + 4(3y 4 + 6ay 2 + 12 by - a 2 )w 

+ (8y 6 + 40ay 4 + 160by 3 - 4a 2 y 2 - 32 aby - 8a 3 - 64b 2 ). 

The roots of g(n ) are 

w f = 2a - 2y 2 + 4 yO t + 40? (i = 1, 2, 3). 

Thus Q(w t ) = Q(9j), i = 1, 2, 3. The discriminants of/and g are both equal to 

2 12 (y 3 + ay + 6) 2 (—4a 3 — 21b 2 ), (2) 

while the discriminant of the field Q(0 i ) divides -4a 3 - 27b 2 . 

We specialize a bit further. Assume that a, b are algebraic integers and 
that z 3 + az + b determines a cubic cyclic extension of k; thus its discrimin¬ 
ant — 4a 3 — 27 b 1 is a square in k. This guarantees by (2) that the discrimin¬ 
ant of/(x) is a square in k, and so the Galois group of/is a subgroup of A 4 . 
In fact /(x) has Galois group A 4 if and only if it is irreducible. We aim for 
more; we shall show in fact that y can be chosen so that/(x) is irreducible in 
^«i[x] and k q2 [x] for two primes q 1 and q 2 of k. What follows is a description 
of how these primes are chosen. 

Let q be a prime of k at which z 3 + az + b is split completely; there are 
infinitely many such q available by the Tschebotarev density theorem. Then 
there is an algebraic integer y 0 e k so that yo + «y 0 -I- b = 0 (mod q). If 
yo + dVo + b # 0 (mod q 2 ), we stop at this point. Otherwise let n be an 
algebraic integer in k that is a uniformizing parameter at q. Then if 
we have 

y\ + ay t + b = yl + ay 0 + b + 3n 2 y 0 + n(3yl + a). 

Thus y 3 4- ay, + b ^ 0 (mod q 2 ) unless q \ 3yjj + a. But 3z 2 + a is the deri¬ 
vative of h(z) = z 3 + az + b, and these polynomials have no common roots 
(mod q) except for the finite set of primes q that divide the discriminant of h. 
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Let then q, and q 2 be primes for which h(z) splits completely and which do 
not divide the discriminant of h. We also assume that q t \ 2, i = 1, 2. By the 
Chinese remainder theorem and the discussion above we can choose an 
algebraic integer y of k so that 

y 3 + ay + b = 0 (mod q t ) 
but 

y 3 + ay + b 0 (mod qf), i = 1, 2. 

We summarize this construction in: 

Theorem 2.1 Let k be an algebraic number field. Suppose the constants 
a, b, y e k and primes q lf q 2 are determined as above. Then for f (x) determined 
as in (1) we have: 

(a) f(x) is irreducible in fc 41 [x] and /c 42 [x]. 

(b) The Galois group of f over k is A 4 . 

Proof If/(x) is irreducible in fc 41 [x], then it is irreducible in k[x], so the 
order of the Galois group of/ is divisible by 4. Since the resolvent cubic field 
of/is the field of the irreducible cubic z 3 + az + b, the order of this group is 
divisible by 3. But the Galois group of / is a subgroup of A 4 since the 
discriminant of / is a square, and so must be all of A 4 . Thus (b) will follow 
from (a). 

Let q = q, or q 2 . Clearly/cannot be the product of a linear factor and an 
irreducible cubic in Ic ? [x] since the resolvent cubic of/splits in lc 4 [x]. Thus if/ 
is reducible over k q , it must be a product of two quadratic factors (not 
necessarily irreducible themselves) in lc 4 [x]. In this event there are q- adic 
integers r, s, and t with 

/(x) = (x 2 + rx + s)(x 2 - rx + t). (3) 

Identifying coefficients in (3) gives the equations 
s + t - r 2 = - (6y 2 + 2a) 
r(t — s) = 8(y 3 + ay + b) 

st=- (3y 4, 4- 6ay 2 + 12by - a 2 ). (4) 

It is easily verified that (3y 2 + a) 2 = — (3 y 4 + 6 ay 2 + 12 by — a 2 ) (mod q), 
and so/(x) = (x 2 - (3y 2 + fl) 2 ) 2 (mod q). Thus in (4) we have 

s = t = — (3y 2 + a) (mod q) and r = 0 (mod q). 

Then r(s - t) = 0 (mod q 2 ), but 8(y 3 + ay + b) 0 (mod q 2 ) by construc¬ 

tion. This contradiction establishes the irreducibility of/(x) in /c,[x]. 
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We have the: 

Corollary 2.2 A 4 is admissible over all algebraic number fields . 

Proof Let k be a number field and /(x) the quartic determined in 
Theorem 2.1. Let L be the splitting field of/(x) over k. We wish to show L is 
/c-adequate. By Schacher [2, 2.1] it is enough to show that the abelian group 
H 2 {A 4 , L*) has an element of order 12, where L* = L-{0}. Since the 
3-Sylow subgroup of A 4 is cyclic, H 2 (A 4 , L*) has an element of order 3 by 
Schacher [2, p. 455]. Thus it is enough to show that H 2 (A 4 , L*) has an 
element of order 4. Let D be the fc-division ring with Hasse invariants 
determined by 

inv„(D) = i inv„ 2 (D) = ~i in \ q (D) = 0, q ± qi or q 2 . 

Since/(x) is irreducible at q x and q 2 ,41 [L q . : k q ] (i = 1, 2) and so L splits D. 
This means that D determines an element of H 2 (A 4 , L*) of order 4, and L is 
k-adequate. But Gal (L/k) = A 4 , and so A 4 is /c-admissible. 

We close with an explicit example over the rational field Q. The irredu¬ 
cible cubic Z 3 — 3Z — 1 of discriminant 3 4 determines a cyclic cubic field 
over Q, so we choose a = —3, b — — 1. If y = 3, then y 3 + ay + b = 17. 
Then the polynomial/(x) becomes 

/(x) = x 4 — 48x 2 + 136x — 36 

with discriminant 2 12 3 4 17 2 . An examination of the Newton polygon shows 
that /is irreducible in Q 2 [x], and the proof of Theorem 2.1 shows/irredu¬ 
cible in Q xl [x\ Let L be the root field of/over Q. Then Gal(L/Q) = A 4 and 
L is Q-adequate. 

We note that all of our remarks apply for global fields of characteristic 
not 2 or 3, and we get: 

Corollary 2.3 A 4 is k-admissible for all global fields k of characteristic 
p 2 or 3. 
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A typical question in extremal number theory is one which asks how large a subset R may be 
selected from a given set of integers so that R possesses some desired property. For example, it is 
not difficult to see that if R is a subset of the integers [1,2,..., 2 N] and R has more than N elements 
then there are integers x and y in R so that x 4- y is also in R. The sets {1, 3, 5,..., 2 N — 1} or 
{N + 1, N -I- 2, ..., 2 N} show that this bound cannot be improved. 

In this note we prove several general results of this type. In particular, we show that ifR^{l, 
2,..., N} and R has more than N — [ N/n ] elements, then for some integers x and y, the integers x, 
x + y, x + 2y, ..., x + (n — 1)>> and y all belong to R. Furthermore the bound N — [N/n] is best 
possible. 

f The work done by this author was done while he was a consultant at Bell Laboratories. 
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1. Introduction 

Suppose <£ = {Li(x u ..., x m ) = 1 a ij x j : 1 < ( < n} is a set of linear 

forms in the variables Xj with integer coefficients a u . The question we con¬ 
sider is the following: 

How large may a subset R of {1, 2,. N} be so that for every choice of 
positive integers tj, 1 <j<tn, at least one of the values L,(?,, t m ), 
1 < i < n, is not in R. 

Unfortunately, this question appears to be rather difficult and very few 
general results are currently available. In this paper we study this problem 
for several important special sets if. It will be seen that even in these simple 
cases, the problem is not without interest. 


2. Preliminaries 


Let [1, N] denote the set {1, 2,..., N}. If if = {L^Xj, ..., x m ): 1 < i < n} 
is a set of linear forms, we say that a set R £ [1, N] is J T-free if for any choice 
of positive integers t u ..., t m , at least one of the values L^t u does not 

belong to R. If R is not if-free, we say that if hits R. Define 

S y (N) = max \R\ 

R 

where the max is taken over all R £ [1, jV] that are if-free and | R | denotes 
the cardinality of R. Also, define <5(if), called the critical density of if, by 

<5(if) = lim inf S y (N)/N. 

N 

As an example, consider the system if„ = + kx 2 : 0 < k < n}. The 

condition that R is if„-free means exactly that R contains no arithmetic 
progression of n terms. 

For this example, a recent result of Szemeredi [2], however, asserts that 
any infinite set of integers of positive upper density contains arbitrarily long 
arithmetic progressions. From this it follows at once that 5(£P „) = 0. 


3. Augmented Arithmetic Progressions 

We now consider a system closely related to if„ which we denote by if*. 
It is defined by 

if * = {xx + kx 2 : 0 < k < n} u {x 2 }. 

In this case, if? hits R if and only if R contains an arithmetic progression of 
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n terms together with the common difference of the progression. However, 
the critical density of if* differs sharply from that of S£ n as the following 
examples indicate. 

Example 1 Let R l ^ [1, N] be defined by 

Ri = {x e [1, N]:x> [N/n]}. 

Clearly R x is if*-free since 

t x + (n — 1 )t 2 > n( 1 + [N/n]) > N for t u t 2 e R x . 

Thus 

-n-K (1) 

Example 2 Suppose n is prime and let R 2 £ [1, N] be defined by 
R 2 = {xe[l,]\T]:x^0 (mod n)}. 

Then ££* cannot hit R 2 since for any integers t t and t 2 , either 

t 2 = 0 (mod n) or tj + kt 2 , 0 < k < n, runs through a complete residue 

system modulo n and therefore represents 0 $ R 2 . Note that 

\R 2 \ = N-[N/n]= \Rt\- (2) 

The following result shows that equality holds in (1) and, in fact, (2) is best 
possible. 

Theorem 1 Suppose R £ [1, N] with \ R \ > N — [N/n]. Then <£* hits 
R. 

Proof Let R satisfy the hypothesis of the theorem and suppose R is 
jS?*-free. Let A denote the least element of R. Then we may assume 

A < [N/n] (3) 

since otherwise | R | < N — [N/n]. Define the arithmetic progressions 
7] £ [1, N] by 

7] = {i + kA: 0 < k < n}, 1 < i < N — (n — 1)A. 

Also, define A Jy A' £ [1, N] for 1 < j < n as follows: 

A = - + !»7 a ] for 1 < «- 

J |[(n - 1)A + 1, N] for j = n; 

= l[N —jA + l, N - (j - 1)A] for l</<n, 

1 |[1, N— (n — 1)A] for j = n. 

By (3), we see that 


IA|= |A|>A. 
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Also, it is easily checked that if x e A } n A' r then j + / = n + t for some t, 
1 < t < n, and 


|{i:xe 7]}| = t. (4) 

We claim the following equation holds: 

n -1 

n\R\= X \T t nR\ + Z(n-j)(\Aj n R\ + \A) n R\). (5) 

i =1 

To prove (5), let xeR. Then for some k and k f , x e A k n A' k ,. Since the Aj 
are disjoint, as are the A }, then the contribution x makes to the second sum 
on the right-hand side of (4) is just (n — k) + (n — k'). Let k + k' = n + t. 
Hence, by (4), x contributes exactly t to the first sum in (5). Therefore, each 
x e R contributes exactly 


(n — k) + (n — k') + (k + k' — n) = n 

to the right-hand side of (5) so that Eq. (5) is indeed valid. But by hypothesis, 
since A e R, then | 7] n R | < n — 1 for all i. Thus, since | A v n R | = 1, 
then by (5) 

n\R\ < (n - 1 )(N - (n - 1)A) + 2A - j) - (n - 1)(A - 1) 

j =i 

= (n - 1 )N + A(-(n — l) 2 + n(n — 1) — (n — 1)) + n — 1 
= (n - 1 )(N + 1), (6) 

which implies 


l*l ^ 


(n - 1 )(N + 1) 


= N — 


This proves Theorem 1. | 

Of course, it follows from (1) and (7) that 

S^JN) = N — [iV/n] 


( 7 ) 


( 8 ) 


and consequently 


acsr;) = i- it 1 . 


4. Forms in One Variable—A Special Case 

As a prelude to a discussion in the next section of the general case of 
linear forms in one variable (i.e., with m = 1), we consider first the special 
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case if = {x, 2x, 3x}. This example in fact has all the essential features of the 
general case. 

To begin, we let D = {<fi < d 2 < • • •} denote the set of all integers of the 
form 2 a 3 b , a, b > 0. 

Let N be a fixed positive integer. For 1 < t < N with (t, 6) = 1, let C(t) 
denote the set 


C(f) = [l, N]n{td k :k=l, 2, ...}. 

Note that a set R 2 [1, N] is if-free if and only if R(t) = R n C(t) is if-free 
for all t with (t, 6) = 1. For indeed, if can hit R only if for some x, 
{x, 2x, 3x} 2 R. However, this implies that if hits R(t) for some t relatively 
prime to 6. Thus, a maximal if-free set R is formed by taking the union of 
maximal if-free subsets from C(t) for each t, ( t , 6) = 1. However, it is clear 
that 

X, = {td k :k= 1, ...,/■} £C(f) 

is if-free if and only if X x = {d k : k — 1, ...,/•} S C( 1) is if-free. Thus, if/(r) 
denotes the cardinality of the largest if-free subset of {d u ..., d r } and h(r) 
denotes the number of t e [1, N], ( t , 6) = 1, with | C(t) \ = r, then for any 
if-free set R E [1, N ], 

00 

\R\Z ZArMr). (9) 

r= 1 

For fixed r, | C(t) \ = r if and only if 


i*e., 


Thus, 


td r < N < td r+1 
N/d r+l < t < N/d r . 


__L 


as N -* oo 


and, therefore, for maximal if-free sets R N £ [1, N], 


lim 

A-oo 


1 Rn 
N 


1 

3 



( 10 ) 


( 11 ) 


But 


f(r + 1) -f{r) < 1, 
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so that letting K(jSf) denote the set {k: f (k) > f (k — 1)}, the telescoping sum 
in (11) becomes 




I 

k e K(^) 


1 _ 

d k 


( 12 ) 


Unfortunately, there does not seem to be any simple way to determine the 
elements of The first few values are given in Table 1. 


TABLE l 


k 

f(k) 

k 

m 

k 

m 

1 

1 

13 

9 

25 

17 

2 

2 

14 

10 

26 

18 

3 

2 

15 

11 

27 

18 

4 

3 

16 

11 

28 

19 

5 

4 

17 

12 

29 

20 

6 

5 

18 

13 

30 

20 

7 

5 

19 

13 

31 

21 

8 

6 

20 

14 

32 

22 

9 

7 

21 

14 

33 

22 

10 

7 

22 

15 

34 

23 

11 

8 

23 

16 

35 

24 

12 

8 

24 

17 

36 

25 


Thus, 

K( <£) = {1, 2, 4, 5, 6, 8, 9, 11, 13, 14, 15, 17, 18, 20, 

22, 23, 24, 26, 28, 29, 31, 32, 34, 35, 36, ..(13) 

It may be that f(k)= 1 + [2/c/3] if k ^ 0 (mod 3) and, perhaps, for all k , 
there is always a maximal if-free set 

R k = {2 ai 3 bi : i = 1,...,/(*)} £ {d u ...,d k } 

in which all a t — b i are congruent modulo 3. 

It would also be interesting to know if 8(3’) is irrational. 


5. Forms in One Variable—The General Case 

Let denote the set of linear forms {ajX, a n x} where 
A = {a, < • • • < a„}. Let P(A) = {q k ,..., q r } be the set of primes dividing the 
a* and let D ( ^ = (d, < d 2 < •••) denote the set of all integers of the form 
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■■■ q* r , tx t > 0. For each k let f(k) denote the cardinality of a maximal 
if -free subset of {d v ..., d k }. Finally, let K(j£f) be defined by 

K{X) = {k:f(k) >f(k - 1)}. 

By using essentially the same arguments as in the previous section, the 
following theorem can be proved. 

Theorem 2 




no -9j *) z d k 1 


j = 1 keK(<f) 


(14) 


6. Concluding Remarks 

One problem with a representation such as (14) is that it is not clear how 
to describe K(J?) so as to be able to evaluate d* -1 . Several systems 

JS? = JSf(a ls ..., a n ) = {a x x, a n x } of forms in one variable are known, 
however, for which such a description can be given. We list a sample of these 
below. The arguments needed to determine the sets K(i?) are not difficult 
and are omitted. 

1. <5(if(l, p, p 2 , ..., p m_1 )) = (p m -p)/(p m ~ 1) for p prime. Thus, 
<5(if(l, 2)) = | as expected. 

2. <5(if(l, n)) = n/(n + 1). 

3. <5(if(2, 3)) = f. 

4. <5(if(l, 2, 8)) = f|. Some recent results of Harlambis [1] are relevant 
here. 

It seems quite likely that almost all systems <£ have S(J?) irrational 
although not even one such <£ is known at present! 
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. ..tria fait songer au theoreme empirique suivant: 

n etant un nombre entier , soit n x la somme des diviseurs de n , inferieurs a w, soit n 2 la somme 
des diviseurs de n v inferieurs a n 1 ; etc. Cela pose: les nombres n, n v n 2 ,... tendent vers une limite 
A, laquelle est 1 ou un nombre parfait. 

E. Catalan [5] 

This is the origin of the Catalan-Dickson conjecture. Dickson [9] 
pointed out that the sequence will sometimes cycle between the members of 
an amicable pair, or round some larger cycle, though at the time no such 
cycle was known. 

La periode peut avoir plus de deux termes , qu'on pourrait appeler, pour garder la meme 
terminologies des nombres sociables. Par exemple le nombre 12496 engendre une periode de 4[sic] 
termes , le nombre 14316 une periode de 28 termes. 

Enfin dans certain cas, on arrive a des nombres tres grands qui rendent le calcul insupportable. 
Exemple: le nombre 138. 

P. POULET [21] 

Poulet’s calculations led him to believe the contrary of the Catalan- 
Dickson conjecture, namely that there are sequences that increase 
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indefinitely, albeit somewhat erratically. Selfridge and the present writer, 
with considerable computational help from others [1, 2, 8, 11, 16, 17, 19,20] 
have offered both experimental and heuristic evidence [13-15, 23] for the 
almost diametrically opposite conjecture that almost all even aliquot se¬ 
quences are unbounded. 

Let s(n) be the sum of the aliquot parts of n (divisors of n other than n 
itself so that s(n) = o{n) - n, where a(n) is the usual sum of divisors func¬ 
tion. Define s°(n) = n, and, for k > 1, 

s*(n) = s(s*“'(«)), 

(here often written n : k). D. H. Lehmer answered Poulet’s question about 
138 by showing that the sequence has a maximum 

179931 895322 = 138 : 117 = 2 • 61 • 929 • 1587569, 

and that 138 : 177 = 1. The next number to give difficulty is 276. Computa¬ 
tions were made by Paxson [19, 20], by Cohen [6], and, much more exten¬ 
sively, by Lehmer, and our present state of knowledge of the 276 sequence is 

107100 047962 427456 048833 497403 019424 = 276 : 433 = 2 5 3 • 199 • c, 

where c is a 31-digit composite number with no small factors. The desire to 
calculate such sequences has been one of a number of stimuli leading to the 
development of a variety of new algorithms for factoring large numbers [12], 
Another question asked by Poulet concerned the existence of cycles of 
periods other than 1,2,5, and 28. In particular it is still not known if period 3 
can occur. Comparatively recently 14 cycles of period 4 have been found by 
Borho [4], Cohen [6], David [7] and Root [3], namely those for 

n= 1264460, 2115324, 2784580, 4938136, 7169104, 

18048976, 18656380, 28158165, 46722700, 81128632, 

174277820, 209524210, 330003580, 498215416. 

An interesting example of a periodic sequence is 17490, for which 
17490 : 228 = 1264460, the smallest member of the smallest of these 4-cycles. 

So aliquot sequences may be classified as terminating if a value of k is 
known such that n: k = 1, or periodic if numbers c > 1 and k are known 
such that n: k = n : c + k, or incomplete otherwise. Since knowledge is 
steadily advancing, membership of these classes varies with time. A good 
deal of calculation has been done at levels greater than 10 24 : For example, 
the Lehmers have investigated the following sequences, in addition to that 
for 276: 

35149 477396 986268 016618 686344 127020 = 552 : 181 = 2 2 3 2 5 • 7 2 c, 

2 422499 075303 417661 059252 663526 = 564 : 265 = 2 • 3 2 23 • 89 • c. 
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166 036689 342670 728789 598390 618080 = 660 : 168 

= 2 S 3 • 5 • 7 • 11 • 37 • c, 

8 636062 816231 780695 927699 123810 = 840: 195 

= 2 • 3 2 5 • 7 • 53 • c, 

34 177685 625161 284824 431442 117328 = 966 : 184 

= 2 4 7 3 139 • c; 

H. J. Godwin (written communication) has calculated 
8502 332502 428553 163137 122717 455339 104096 = 1074 : 849 

= 2 5 3 • 31 • 457 • c, 

(assuming that a 38-digit pseudoprime in term 847 is prime) and 

60012 099828 734883 991294 272626 672754 = 1464 : 530 = 2 • 3 • 29 • x, 

where x was still to be tested; and Selfridge and Wunderlich (see [14, 17]) 
have pursued all sequences starting below 10 4 to beyond 10 24 . However, 
only one sequence, 4488, has exceeded this level: 

1 807814 653481 873196 164108 = 4488 : 459 

= 2 2 13 ■ 991 • 146819 • 3201581 • 74632871, 

and later been found to terminate, 4488 : 801 = 1. 

Devitt [8] has pursued all even numbers in the ranges (10* + 2, 10* + 
100) Jfc = 9, 10, 11 and 12 (see also [16]) and found the following numbers 
of the three classes of sequences: 


k = 

9 

10 

11 

12 

terminating 

161 

146 

139 

113 

periodic 

7 

4 

4 

4 

incomplete at 10 18 

332 

350 

357 

383. 


The steady increase of the numbers of incomplete sequences is, of course, 
partly accounted for by the fact that the later samples start somewhat nearer 
the arbitrary bound, 10 18 , but this effect is only marginal, and if we calculate 
the average order of s(n) for n even (e.g., by the method given in Hardy and 
Wright [18, Theorem 324] we obtain 

n(57t 2 /24 — 1) w 1.05617n > n, 

so that, on average, such sequences can be expected to increase. In [15] 
Selfridge and the present writer give detailed consideration to the various 



114 


RICHARD K. GUY 


guides and drivers (see below) which control aliquot sequences. Similar ideas 
are outlined by Wunderlich [23], and by Devitt [8, Chapter 4]. 

H. J. J. te Riele [22] has constructed an aliquot sequence containing 5092 
(and almost certainly many more) monotonically increasing terms. A most 
interesting remark in te Riele’s paper is the last sentence stating that H. W. 
Lenstra can prove the existence of arbitrarily long monotonically increasing 
aliquot sequences. Versions of Lenstra’s proof are given by Erdos [10] and 
Devitt [8, pp. 42-44], Erdos also proves the stronger result: 

Theorem For all n except a sequence of density 0, and for every k and 

d>0, 

for 

Here we use Lenstra’s method to establish the following remarkable 
result: 

Theorem Given any prime p k , any integer t , and any real number p > 1, 
there are aliquot sequences containing t consecutive terms , each greater than p 
times the previous one , but whose only prime divisors exceed p k . 

Proof We construct such a sequence, 

n = n : 0, n : 1, n : 2, ..., n : t, 

where, for 0 ^ r < f, 

w : r = Pk\ 1 Pk 2 + 2 Pk + / ' * * Pk l +l + t-r m r > 

and none of the first k + Z + t — r primes divide m r . 

First choose / so that the sequence diverges with the required rapidity. For 
0 < r < r - 1, 

n : r + 1 s(n : r) _ o(n : r) 
n:r n:r n:r 

1 Pl\ 2 ••• PZ + i + r-rWr) , 


Pi+lPk + 2 ••• Pk + l + t-rttlr 



^(pt+ 2 ),. 

o{m r ) 

- 1 

Pk\ 1 

p“\ 2 

m r 

<t(Pk l + l)o 

(PSP+2 )...<t(Pk + l) 

-1 

Pk+ 1 

p a k\l 

Pk + i 

Pk+1 + 1 

_ Pk+2 + 1 

Pk+i + 1 

Pk+ 1 

Pk + 2 


Pk+i 
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and since n (1 + P 1 ) diverges, we may choose / (e.g., I = [k p+l )) to make 
the ratio greater than p. 

Next choose a t to exclude the first k primes as factors of the n: r, e.g., 
a k = 4>(p\p\ ■■pi) - 1. By the Euler-Fermat theorem, since (p„ p k+l ) = 1 
for 1 < i < k, 

PF+i = 1 mod 

so p£VV = 1 mod Pi, and Pi\<j(p a k ' +l ), p,|tr(n:r), and since p ; |n: r. 
Pi \ s(n : r) - n : r + 1 for 0 ^ r ^ t - 1 and each i, 1 ^ i ^ k. 

Finally, choose the remaining a j+ i9 1 ^ j < / + t - 1, recursively, so as 
to ensure that 

PV+j || n : r for — r. 

E.g., take a j+l = — 1, so that, as before, 

PVijti = 1 mod pVtjK 

so p2 j +/I ct (p* j +7+i)> Pk+J l \°(n ■ r), and since p a k ‘ +j || n : r, we have 
P? J +j || s(n : r) = n : r + 1 , 0 < r < t - 1 . | 

As can be seen from the form of the proof, the powers of the primes used 
are subject to eventual erosion. Dickson [9] looked for combinations of 
powers of small primes that would “lock in,” but any such search is doomed 
to failure; in fact Selfridge and the present author [15] give the following 
result. 

Theorem For any d , there is some prime divisor of d that does not divide 
cr(d)/d. 

On the other hand, some combinations do have remarkably long “lives,” 
and these are important in making quantitative estimates. We restrict our 
attention to the factors of cr(2 fl ) = 2 a+ 1 — 1. Define a guide g to be g = 2 a v 
where v divides <r(2 fl ). If also 2 a ~ 1 divides <j(v) 9 the guide is called a driver. If 
g = 2 a v divides n and 2 a || n, g is said to be a guide of (driver of) n. Drivers 
include the even perfect numbers; there are only finitely many others. 

Theorem [15] The only drivers are 2, 2 3 3, 2 3 3*5, 2 5 3*7, 2 9 3* 11*31 
and the even perfect numbers. 

We may calculate the average order of s(n)/n for those n having a given 
guide, and this gives an estimate of the amplification of that guide, or ex¬ 
pected ratio of s(n)/n while the guide is in control of the sequence. Table 1 [8, 
p. 58] gives calculated amplifications for some of the more commonly occur¬ 
ring guides, and compares them with the results obtained from studying 
more than 186,000 terms, lying between 10 5 and 10 18 , of the 2000 sequences 
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with n even, 10* + 2 ^ n < 10* + 1000, 9 < k ^ 12. Less than 15000 terms 
had guides with powers of 2 greater than the fourth, or odd guides. 


TABLE 1 


Guide 

Number of 

occurrences 

Calculated 

amplification 

Observed 

amplification 

2 

46526 

it 1 

1 = 0.64493 

0.62787 

2 • 3 

24323 

it 2 /4 — 

1 = 1.46740 

1.26744 

2 2 

32573 

3*714 - 

1 = 1.11492 

1.07573 

2 2 7 

27607 

* 2 /4 — 

1 = 1.46740 

1.40354 

2 3 

10014 

*7 5- 

1 = 0.97392 

0.97143 

2 3 3 

6018 

37t 2 /10 — 

1 = 1.96088 

1.69723 

2 3 5 

5097 

5*718 - 

1 = 1.74156 

1.38542 

2 3 3 • 5 

2917 

* 2 /3 — 

1 = 2.28987 

2.17929 

2 4 

11708 

15* 2 /62 - 

1 = 1.38781 

1.34447 

2 4 31 

4919 

* 2 /4 — 

1 = 1.46740 

1.44918 


The agreement between theory and practice is as good as one can expect. 
“Typical” behavior cannot be expected unless the number of prime factors is 
“large”; that is, when In In n is large, which is when n is well beyond com¬ 
puter range. The only even guides with amplification less than 1 are 2 and 
(but only just!) 2 3 . These two account for 56 540 of the terms examined, less 
than one-third of the total. 

One can also estimate, and observe, the break probability for a guide to 
change to a different guide, and the life of a guide, or expected number of 
terms in a sequence before the guide changes. 

For example, the break probabilities for the 2-driver and for the 2 3 guide, 
among terms of size n are approximately 

| In n and (In In n) 2 /3 In n, 

and the corresponding lives are approximately 

5(ln n)/9 and 3(ln n )/(In In n) 2 . 

With the previously calculated amplifications, we can estimate the size of a 
term after an average run under the control of a given guide. For these two 
guides we can expect a term of size n to become of the order 
n 3/4 and „i-(inin»)2/i4 

For the sample of 2000 sequences mentioned above, the observed lives were 

9.33 and 2.7. 
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The fact that these are less than the estimates is again due to the fact that the 
size of term that can be investigated by computer is far too small to exhibit 
typical behavior. 

The general behavior of aliquot sequences may be considered as a 
Markov process whose states are determined by the guides that are in 
control. The limit of the powers of the stochastic matrix associated with the 
same 2000 sequences yields the following probabilities of the seven most 
popular guides being in control: 

2 2.3 2 2 2 2 7 2 3 2 4 2 4 31 

0.2256 0.1025 0.1535 0.1851 0.0482 0.0580 0.0481, 

accounting for 0.821 of the total probability. If we use these (and the other) 
probabilities to make a weighted average of the amplifications we obtain the 
figure 

1.05745, 

in very good agreement with the calculated amplification, mentioned before, 
the average order of s(n)/n taken over all even numbers, 

5tc 2 /24 - 1 « 1.05617. 

It is highly unlikely that the Catalan-Dickson conjecture can be refuted, 
but the circumstantial evidence against it seems to have become 
overwhelming. 
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1. Introduction 

An arithmetical problem of some interest is the study of integral matrices 
A for which AA T is of some special form. The incidence matrix A of a 
symmetric block design with parameters v 9 k, A is a (0, 1) matrix of order v 
satisfying AA T =-- (k — A)I + AJ = B, J being the matrix of order v all of 
whose entries are 1. The case we are interested in here is that of an integral 
matrix A of order n such that AA T = ml. This includes all permutation 
matrices when m = 1, and the Hadamard matrices H — [h ( j\ of order n for 
which every h {j == ± 1 and HH T = nl n . 

It is easy to find necessary and sufficient conditions that AA T = ml n have 
an integral solution A of order n. These are: (1) if n is odd, m is a square; (2) if 
n = 2 (mod 4), m = a 2 + b 2 \ (3) if n = 0 (mod 4), m is any positive integer. 
These conditions are established in Section 2, and are necessary for the 
existence of a rational A and sufficient for the existence of an integral A. 

If X is the r by n integral matrix consisting of the first r rows of an A for 

t This research was supported in part by NSF Grant MPS 72-05035 A02. 
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which AA T = ml , then necessarily XX T = ml r . The difficult and challenging 
problem is that in which we are given an r by n matrix X with XX T = ml r to 
find whether or not there is an A with AA J = ml that has X as its first rows. 
This we call the completion problem. In Section 3 using results of [2], we can 
show that there is always a rational completion (m and n satisfying the 
preceding conditions), that is an A with X as its first rows and its remaining 
rows rational. 

In Section 4 it is shown that if there are only one or two rows to be added 
to X , then there is an integral completion A. In Section 5 the case of Hada- 
mard matrices is examined more carefully and it is shown that if at most four 
rows need to be added to an X there is a Hadamard completion. But there 
are Hadamard X's that do not have Hadamard completions, though they 
may have integral completions, as an example shows. 

2. Existence Conditions 

Following the methods of Chowla-Ryser [1] we can easily determine 
exactly conditions for the existence of an integral square matrix A of order n 
such that A t A = ml. 

Theorem 2.1 There exists an integral square matrix A of order n such 
that A t A = ml , m a positive integer if and only if: 

(i) for n odd m is a square , m = m 2 ; 

(ii) for n = 2 (mod 4) m is a sum of two integral squares , m — a 2 + b 2 \ 

(iii) for n = 0 (mod 4) m is any positive integer. 

Proof Let A = [a 0 ], Uj= 1,..., n. Let x l9 ..., x n be indeterminates. Let 
us write 

= £ dijXj, i = 1, ..., n. (2.1) 

j 

Then the matrix equation 

A T A = ml (2.2) 

is equivalent to the identity 

Lf + * * * + Ll = m(xi + ’ * * + x^), (2.3) 

as is well known from the theory of quadratic forms, but a direct check is 
easy in this case. 

Taking determinants of both sides of (2.2) we have 

(det A) 2 = m n (2.4) 

from which it readily follows that if n is odd, then m must be a square 
m = m J, then taking A = m i I, Eq. (2.2) is satisfied. 
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From here on let us suppose that n is even. Then from the theorem of 
Lagrangef any positive integer m can be written as a sum of four integral 
squares 

m - b\ + b\ + b 3 + bj (2.5) 

and we have identically 

(bj + b\ + bl + bl)(xf + xf + ! + xf + 2 + xf +3 ) = yf + yf + 1 + yf + 2 + yf + 3 

( 2 . 6 ) 

where 


yi = biXi - b 2 x i+i - b 3 x i+2 - b A x i+3 
= b 2 Xi + b i x i+1 - b A x i+2 + b 3 x i+3 
y i+2 = b 3 x t + b 4 x i+l + b l x i+2 - b 2 x i+3 
y i+3 = b 4 x s - b 3 x i+ 1 + b 2 x i+2 + h,x i+3 . 
We note that if we define a matrix B by 

— b 3 — b 4 

— h 4 b 3 
bi 


B = 


bi 

b 2 

bs 

K 


~b 2 
b i 
b A 
~b 3 


~b 2 

bi 


(2.7) 


( 2 . 8 ) 


then by direct calculation 

B t B = BB T (bj + b 2 2 + bj + bl)l. (2.9) 

From this it follows that det B = ±(bj + b 2 + b 3 + bi) 2 and since b\ has 
coefficient +1 in det B that det B = (bj + b\ + b 3 + bi) 2 . Hence from (2.7) 
we can express the x’s in terms of the y’s rationally with denominator m 2 , 
where m = b 2 + b 2 + b 3 + bi. 

We also note that if n = 0 (mod 4) and if m = b 2 + b 2 + b\ + bi, then 
we can take a matrix A as the direct sum of nj 4 copies of B: 

0 


A = 


B 

0 

0 

0 


0 


B 


and from (2.9) it follows that 

A t A = AA t = ml. 

This covers part (iii) of the theorem. 


( 2 . 10 ) 


( 2 . 11 ) 


f The arithmetical results used in this section may be found in Hardy and Wright [3]. 
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It remains to consider the cases when n = 2 (mod 4). We can apply (2.6) 
to the x’s in (2.3) four at a time to obtain the identity 

Lf + ••• + Q = y\ + + 2 + w(x„_! + x 2 ). (2.12) 

Here y u ..., y n _ 2 and x„_ i9 x n are independent indeterminates, and L u ..., 
L n are rational linear forms in these indeterminates. We can determine y x as 
a rational linear combination of y 2 , y„-. 2 > *„-i> by putting 

L, = ± yi (2.13) 

where we take L t = y t if the coefficient of y, in L ; is not +1, and putting 
L, = -y! if it is. Using (2.13) to express y x in terms of y 2 ,..., y n - 2 , x n -u x„ 
we have from (2.12) 

L-l + ••• + Ll = y\ + ••• + y 2 n -2 + m(x^_ t + x 2 ), (2.14) 

an identity with L 2 ,..., L n rational linear forms in independent indetermin¬ 
ates y 2 , y n ~ 2 , x„_i, x n . Similarly, we put L 2 = ±y 2 to express y 2 as a 
rational linear form in y 3 , ..., y„- 2 , x n -i> x n- Continuing this process we 
finally obtain an identity 

L 2 -1 + L 2 = w(x„ 2 _ j + x 2 ) (2.15) 

with L n _ t and L^ rational linear forms in x„_ x and x n . Taking x n _ x and x„ as 
nonzero multiples of the denominators in L n _ x and L„, we can consider 
(2.15) as an equation in nonzero integers. From this it follows that m itself is 
the sum of two integral squares 

m = a 2 + b 2 . (2.16) 


Conversely, given (2.16) and n = 2 (mod 4) we can define a matrix C of 
order 2 



(2.17) 


and taking A as the direct sum of n/2 copies of C we shall have 
AA J = A t A = ml. This completes the proof of part (ii) of the theorem in¬ 
cluding the converse. Thus all parts of the theorem are proved. 


3. Rational Completions and Orthogonal Equivalence 


Let A be a square matrix of order n and let X be the r by n matrix 
consisting of the first r rows of A so that 



where W is the n — r by n matrix consisting of the last n — r rows of A. Let us 



INTEGRAL MATRICES A FOR WHICH AA T = ml 


123 


suppose further that every row of W is orthogonal to every row of X. In 
matrix form this is the condition 

XW T = 0. (3.2) 

Then 


H 

II 

X 

w 

[X T W T | = 

XX T 

wx r 

XW T 

WW T 

= 

D, 0 

0 d 2 


(3.3) 

The case in which we are particularly interested is that in which AA T = ml n . 
Here 

XX 7 = D,= ml r , WW 7 = D 2 = ml n _ r . (3.4) 


The completion problem in which we are interested is the following: 
Given anrbyn integral matrix X such that XX T = ml r can we find ann — r by 
n matrix W such that A = [£] satisfies A A 7 = nl n l 

If W is rational, we say that A is a rational completion of X and an 
integral completion if W is integral. 

Providing that the conditions of Theorem 2.1 of the preceding section are 
satisfied it is immediate from a result of the author and H. J. Ryser [2] that 
any X satisfying XX 7 = ml r has a rational completion. We quote Theorem 
2.1 of that paper. 


Theorem (Hall-Ryser) Suppose that AA 7 — D x ® D 2 . Here the matrix 
A is of order n and nonsingular. The matrix D^isof order r and D 2 is of order s 
where r + s = n. Let X be an arbitrary r by n matrix such that XX 7 = D 1 . 
Then there exists an n by n matrix Z having X as its first r rows such that 
ZZ 7 = ® Z) 2 . This result holds for all fields F of characteristic 2. 


Theorem 3.1 Suppose that X is an r by n integral matrix satisfying 
XX 7 = ml r , and that m and n satisfy the conditions of Theorem 2.1. Then 
there is ann by n matrix Z having X as its first r rows and having its last n — r 
rows rational such that ZZ 7 = ml n . 


This follows directly from the Hall-Ryser theorem, taking F as the rat¬ 
ional field Di = ml r , D 2 — ml n - r , and A a solution of AA 7 = 
m/„ = D 1 ®D 2 which exists since the conditions of Theorem 2.1 are 
satisfied. 

The following theorem is an easy generalization of Theorem 2.2 of the 
Hall-Ryser paper. 

Ilieorem 3.2 Suppose that AA 7 = D 1 ®D 2 where A is of order n and 
nonsingular , and and D 2 are of order r and n — r and are nonsingular. 
Suppose further that X and Y are rbyn matrices such that XX 7 = YY 7 = D v 
Then there exists an orthogonal matrix U of order n such that XU = Y. This 
result holds for all fields F of characteristic different from 2. 
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Proof From the Hall-Ryser theorem there exist n by n matrices R and S 
such that 


with W l9 W 2 n — r by n matrices such that 

RR t = Dj © D 2 , SS r = D x ® D 2 . (3.5) 

Putting U = R~ 1 S it follows from (3.5) that UU T = I or that U is ortho¬ 
gonal. Then 

['' "U-t "Is 0.6) 


xu= y, 

which is the conclusion of the theorem. 


4. Results on Integral Completions 


TTieorem 4.1 Suppose that m and n satisfy the conditions of Theorem 2.1. 
Then if r = n — 1 or n — 2 and X is an integral r by n matrix satisfying 
XX T = ml r there is an n by n integral matrix A having X as its first r rows 
such that A A 1 = ml n . 

Proof For r = n — 1, the result is easy. Let A be a rational completion 
having X as its first n — 1 rows, whose existence is given by Theorem 3.1. 
Then in fact A is an integral matrix. 

A = [a u l AA T = ml n , A T A = ml n . (4.1) 

The last follows since mA~ l — A T , and yields 

X afj = m, j = 1, ..., n. (4.2) 

i 

But a lj9 a 2j9 ..., a„_ Xj are integers as entries in X. Hence a\ } is an integer but 
as a nj is rational it follows that a nj is an integer for j = 1,..., n. Thus A is an 
integral matrix and so is an integral completion of X. 

Now suppose r = n — 2 and let Z be a rational completion of X as given 
by Theorem 3.1. Then 

ZZ T = ml n , Z T Z = ml n . (4.3) 

Here the first n — 2 rows of Z are integral, and the last two are rational. Let s 
be the smallest positive integer such that sZ = A x is an integral matrix. Then 

A : Aj = AjA — s 2 ml n . (4.4) 
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Here if X = [a 0 ], i = 1 ,n - 2, j = 1,.... n, A x has the shape 


sa u * 

• sa xj ■■ 

sa In 

sa 21 

■ ■ sa 2j ■ ■ 

sa 2n 

sa n~ 21 

" sa n-2j 

* Sa n-2n 

*1 

•• xj ■■ 


yi 

■■ yj 

y n 


where all entries are integral. Clearly if s = 1, then A x is an integral com¬ 
pletion of X. We shall show that the least positive s giving a matrix such as 
A x is s = 1. From A[A x = s 2 ml m we have 

s 2 (alj +-1- a 2 - 2 j) + xj + yj = s 2 m, j - 1, n 

s 2 (aija ik + • •' + a n - 2 ja n - 2 k) + XjX k + y } y k = 0, j± k. (4.6) 

Let p be a prime dividing s. Then (4.6) gives 

Xj + yj = 0 (mod p 2 ), XjX k + yjy k = 0 (mod p 2 ), 

j, k = 1, ..., n, j k. (4.7) 
Then if p = 2 or p = 3 (mod 4) it follows from (4.7) that 

Xj = yj = 0 (mod p), j = 1, ..., n. (4.8) 

In this case p~ 2 A k = A 2 is comparable to A, with s = ps, replaced by s,. We 
may suppose that p | n but that p does not divide every x and y in A t . Choose 
j so that p does not divide both Xj and y 7 . Then from (4.7) p divides neither Xj 
nor yj . 

xj + yj = 0 (mod p 2 ), Xj # 0 (p), y s # 0 (p). (4.9) 

As p 2, p ^ 3 (mod 4) it follows that p is a sum of two squares and is not a 
prime in the Gaussian integers 

p = a 2 + b 2 = (a + bi)(a — bi). (410) 

From (4.9) 

( Xj + yji)(Xj - yji) = 0 (mod(a + bi) 2 {a - bi) 2 ). (4.11) 

Since Xj, y } ^ 0 (mod p), it follows that Xj -I- y s i is not divisible by both of 
a + bi and a — bi and also that Xj — yji is not divisible by both of a -I- bi and 
a — bi. Let us suppose the notation such that (xj + yji, a — bi) = 1. Then 
( a + bi) 2 = ( a 2 — b 2 + 2abi) = d + ei divides Xj + yji, where d 2 + e 2 = p 2 . 
Then 


Xj + yji = {d + ei)(Wj + Zji) 


( 4 . 12 ) 
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with Wj and z } rational integers. It follows that 

(d - ei)(xj + yji) = (dxj + eyj + (-e Xj + d yj )i = p 2 Wj + p 2 z s . (4.13) 
Now consider for k ± j the identity 

(XjX k + yjy k ) 2 + (Xjy k - x k yj) 2 = (xj + yj)(xl + yi). (4.14) 

From (4.7) we have x 2 + yf, x k + y 2 , XjX k + yjy k all multiples of p 2 so that 
from (4.14) Xjy k — x k y } is also a multiple of p 2 . 

Now consider the product 

(xj - y } i)(x k - y k i) = (xjy k + yjy k ) + ( -Xjy k + x k yj)i = p 2 u + phi. 

(4.15) 

Since (xj + yji, a — bi) = 1 and p 2 = (a + bi) 2 (a - bi) 2 it follows that 
(a — bi) 2 = d — ei divides x k — y k i. Hence for the conjugates d + ei divides 
x k + y k i. Thus 

X k + y k i = (d + ei)(w k + z k i) (4.16) 

with Wfc and z k rational integers. Then 

(d - ei)(x k + y k i) = (dx k + ey k )(-ex k + dy k )i = p 2 w k + p 2 z k i. (4.17) 
Now write s = ps l and define the matrix M p by 

[ pin-2 

M P — - 


- (4.18) 

d e 
— e d 


It is immediate that M p M T p = p 2 I„. Using (4.13) and (4.17) for k ± j, we have 


M P A X 



e 

d 


PSi X 


x i h x„ 

y i yj I'nJ 

P 2 SiX 


dxi+ ey t • dxj + ey s dx k + ey k ■■ • dx„ + ey„ 

-exi+dy t ■■■ -exj+dyj ••• -ex k + dy k -ex„ + dy„ 


p 2 s x X 

p 2 Wi ••• p 2 Wj ••• p 2 W k •• p 2 W n 

P 2 Z\ ■■■ p 2 Xj ■■■ p 2 z k ■■■ p 2 z n 



(4.19) 
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Here A 2 is an integral matrix with the properties of A u but with s = ps x 
replaced by the smaller s x . Hence the smallest s appearing in such an A must 
be s = 1 and in this case A is an integral completion of X , proving our 
theorem. 

Theorem 4.2 //m aw/ n are integers satisfying conditions of Theorem 2.1 
and if X is an r by n integral matrix satisfying XX T = ml r , then if n < 4 t/iere 
is always an integral completion A of order n with X as its first r rows such that 
AA T — ml n . 

Proof This is essentially a corollary of Theorem 4.1 which covers all 
cases with n <4 except for r = 1, n = 4. But in this case taking X as the first 
row of a matrix B as in (2.8) we have an integral completion of X. 

5. Integral Completions for Hadamard Matrices 

A Hadamard matrix H = [h t j\ 9 i, j = 1, ..., n is a square matrix with 
every entry h u = +1 such that 

HH J = nl n , H T H = nI n . (5.1) 

These two conditions are equivalent as both of them assert that nH~ l = H T . 
It is well known that for a Hadamard matrix n— 1,2 or n = 0 (mod 4). The 
natural question here is whether or not given an r by n matrix X with every 
entry +1 such that 

XX r = nl r (5.2) 

does there exist a Hadamard matrix H with X as its first r rows? 

Leaving the cases n = 1, 2 as exercises for the reader, we shall assume 
n = 0 (mod 4) so that the conditions of Theorem 2.1 are automatically 
satisfied. 

Theorem 5.1 Let X be an r by n matrix , n = 0 (mod 4) with every entry 
+ 1 or — 1 satisfying 5.2. TTien if r = n — 1, h — 2, a — 3, or n — 4 r/iere is a 
Hadamard matrix of order n with X as its first r rows. 

Proof Let H x be a rational completion of X. Then H X H\ = 
HjHi = nl n so that in = [hj we have 

£ hfj = n, £ h tj h tk = 0, j, k = 1, ..., n, j k. (5.3) 

r = 1 r=l 

It follows that 

X h?j = n- Xjh?j = n-r, j=l,...,n 

r “ r + 1 f = 1 


(5.4) 
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and 


E h u h tk = - YjK h tk = £ ± 1 = r (mod 2). 


(5.5) 


In finding a completion for X it is clear that we may permute the rows of 
X or change signs of some of them without altering the final rows of the 
completion. Also if we permute the columns of X and change the signs of 
some of them, if we perform the same operations on the columns of the final 
rows, we shall still have a completion. Since these operations take integral 
completions into integral completions and are reversible, we may use them 
freely in deciding whether or not a specified X has a completion. 

If Y and Z are two rational q by n — r matrices with YY J = ZZ T a 
nonsingular matrix, then by Theorem 3.2 there exists a unitary matrix U of 
order n — r such that YU = Z. Taking transposes we have U T Y T = Z T . 
Hence we may define i/f by 


HX = 


K i 

0 

X 


X 

0 i 

U Y 

w 


U T W 


(5.6) 


Then is also a rational completion of X and if the first q columns of W 
are the matrix T t , then the first q columns of U T W are the matrix Z T . This is 
very useful since in the cases we shall consider we can replace rational 
columns Y T by integral columns Z T and obtain an integral completion. 

First, if r = n — 1 from (5.4) in the rational completion H x we have 
hlj — 1, j = 1, ..., rc, and so h nj = +1, j = 1, ..., n, and thus H x = H is the 
completion which is a Hadamard matrix. 

If r = n — 2 write H l in the form 



' 

H\ = 

x t x 2 •• 

* 


. yi y 2 ** 

* y** _ 


(5.7) 


Here (5.4) and (5.5) give 


Xj + y) = 2 > j=U ■■■,», XjX k + yjy k = 2s, j±k (5.8) 


where 2s is some even integer and since (xj ± x k ) 2 + (yj ± y k ) 2 > 0 we have 
-2 <XjX k + yj y k <2. 

By Theorem 3.2 as x? + y 2 = 2 we can apply (5.6) to replace H, by a 
completion H* in which (x x , yi) = (1, 1), and without loss of generality 
assume this to be the case in H v Thus from (5.8) with j y 1 and k = 1 


xj + yj = 2, 


(5.9) 


Xj + yj= - 2 , 0 , 2 . 
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If Xj + yj = 2, then 

(xj- l) 2 + (yj- l) 2 = 2-4 + 2 = 0 (5.10) 

and Xj = 1, )'j = 1. Similarly if Xj + y 2 = —2, then 

( Xj + l) 2 + (y,+ l) 2 = 2-4 + 2 = 0 (5.11) 

and Xj = — 1 , yj= — 1. 

If Xj + yj= ±2 for all j, then from (5.10) and (5.11) the last two rows are 
identical. This is impossible since the inner product of the last two rows is 
zero. Hence for some j we have Xj + yj = 0 and then 

xj + yj = 2, Xj + yj = 0, 2 xj = 2, Xj = ± 1, y } = +1 

Thus without loss of generality, we may take x 2 = 1, y 2 = — 1. Now has 
the form 

X 


1 1 ••• x= x 

1 -1 ••• yj ■■■ y 

Here for j > 2 we have from (5.9) with k = 1 and k = 2 

xj + yj = 2, Xj 4- yj = 0, ±2, Xj — y } = 0, ±2. (5.13) 

Here the only possibilities are (xj, yj) = (1, 1), (—1, —1), (1, —1), and 
( — 1, 1). Hence H — H x has every entry +1 and so is a Hadamard matrix. 
We might also have appealed to Theorem 4.1 and taken H t as an integral 
completion of X and then xj + yj = 2 would give x } — ± 1 ,yj= ± 1, and so 
H x is an Hadamard matrix. 

Now take r = n — 3 and write the rational completion H x in the form 

X 

Hi= - 

*i ^ ••• x„ 

yi yj ••• ^ 

_Zj Zj ••• Z M 

Here (5.4) and (5.5) give 

xj + yj + zj = 3, XjX k + y } y k + ZjZ k = -3, - 1, 1, 3, 

j, k = l,n, j=hk (5.14) 

since XjX k + y 7 - y k + z } z k is an odd integer in absolute value at most 3. 



(5.12) 

n 

n _ 
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Since x, + y* + z\ = 3 by Theorem 3.2 we may apply (5.6) to replace 
(x,, y u Zj) by (1, 1, 1), and assume this to be the case in (5.13). Then from 
(5.14) with j £ 1 and k = 1 we have 


xj + yj + zj = 3, Xj + yj + Zj=- 3,-1,1, 3. (5.15) 

If + z, = 3, then (xj — l) 2 + (y s — l) 2 + (zj - l) 2 = 0 and 

(xj, yj, Zj) = (1, 1, 1) and similarly if x } + + Zj = -3, then (xj, y jf Zj) = 

(— 1, — 1, — 1). Hence unless Xj + yj + Zj = +1 for some j, the last three 
rows are identical, which conflicts with their inner products being zero. 
Without loss of generality take j = 2 and x 2 + y 2 + z 2 = 1. Then defining Y 
and Z by 


we have 



1 

yi 


l 

*2 


1 1 
1 1 1 


YY J = Z, 



(5.16) 


(5.17) 


Hence from Theorem 3.2 there is a rational orthogonal matrix U with 
YU — Z. Hence applying (5.6) we have a rational completion Hf with 
(x x , y l9 z x ) = (1, 1, 1) and (x 2 , y 2 , z 2 ) = (—1, 1, 1) which we may assume 
to hold in H v 

Then with j >2 and k = 1 and 2 (5.14) becomes 

x j + y) + z j = 3 ’ x j + yj + z j = ±h ±3, 

-x y -\ -yj + Zj= ±1, ±3. (5.18) 

We note that if the inner product of two columns of the last three rows is 
+ 3, then the columns are identical, while if the inner product is — 3 then 
they are negatives of each other. If this held for every j > 2, then the last two 
rows would be identical, a conflict. Thus for certain /s 


Xj + yj + z j = E 1 = ± 1, -Xj + yj -f Zj = e 2 = ± 1. (5.19) 

If £i = e 2 , then Xj = 0; while if = — e 2 , then Xj — e x . Hence every x j9 
j = 1, ..., n is 0 or ±1. But since xj + • • ■ + xl — n, the alternative = 0 
cannot arise and Xj = ± 1,7 = 1,..., n. We can now add the row x x ,..., x w to 
X to get X' and by our previous proof complete X' with two more rows to a 
Hadamard H. But we may also proceed directly taking; = 3, e x = 1 without 
loss of generality. Then with 



1 

1 

1" 



1 

1 

f 

Y = 

-1 

1 

1 

and 

z = 

-1 

1 

1 


*3 

3^3 

*3_ 



1 

1 

-1 


(5.20) 
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we have 


YY t = ZZ T = 


H ,= 


1 -1 1 

1 1 1 

1 1 -1 


1 

f 

3 - 

i 

1 

3. 

:* 3 , 

y$ » 

x j 


yj 

y* 

z j 



(5.21) 


(5.22) 


and we assert that H x is a Hadamard matrix. For we have for j > 3, 


x J + y) + z ) = 3 > 

x i + yj + z j = ± h ± 3, — Xj + yj + Zj= ±1, ±3, 

Xj + yj - zj = ±1, ± 3. (5.23) 


If we have + 3 in any one of these cases, the jth column is equal to one of the 
first three or its negative. There remains 


Xj + yj + Zj = e x = ± 1, -X,. + yj + z, = e 2 ± ^ 

*/ + - */ = £3 = ± 1. (5.24) 

As observed earlier we must have s x = —s 2 and x y = e x . Similarly, if e x — e 3 , 

we have Zj - 0; while if s x = -e 3 , we have z } = e x . And since we must 
exclude Zj = 0 , we have e 2 = —£ 1 , £3 = — e x , and 

(xj , yj , Zj) = (1, — 1, 1) or (-1, 1, -1). (5.25) 

Hence every x, y, and z is ± 1 and the completion of (5.22) is a Hadamard 

matrix. 

The case with r = n — 4 is more difficult to handle: We write a rational 
completion H x as 


X 

*1 * 

• x, 


yi * 

• yj ■■ 

'• y„ 

*1 * 

■ Zj •' 


w i * 

• Wj 



From (5.4) and (5.5) we have 


(5.26) 


xj + yj + zj + wj = 4, XjX k + y } y k + ZjZ k + Wj\v k = 0, ±2, ±4. 

(5.27) 





132 


MARSHALL HALL, JR. 


We subdivide the proof into two subcases. In the first subcase we assume 
that only the inner products 0, ±4 arise. We may apply Theorem 3.2 and 
(5.6) to replace (x l9 y l9 z u Wj) by (1, 1, 1, 1). If the jth column has inner 
product ±4 with the first column, it will be the same as the first column or 
its negative. Since the last four rows are not identical, there will be a j for 
which Xj + yj 4- z } + Wj = 0, which we may take to be j = 2. Then from 
Theorem 3.2 and (5.6) we may replace the first two columns by (1, 1, 1, 1) 
and (1, 1, — 1, — 1). Since the last two rows cannot be identical, there must 
be a column whose inner product is not ± 4 with either of these, and by our 
assumption this means a column whose inner product is zero with both of 
these. Without loss we may take this to be the third column and by Theorem 
3.2 and (5.6) replace (x 3 , y 3 , z 3 , w 3 ) by (1, — 1, 1, — 1). Now H x is in the 
form 





X 


1 

1 

1 

••• Xj ■ 


1 

1 

-1 

... y. .. 

“ .Fn 

1 

-1 

1 

... Z . .. 


1 

-1 

-1 

••• Wj ■ 



(5.28) 


Now we assert that H x is a Hadamard matrix. For xj + yj + z] + wj — 4 
and if this column has inner product ±4 with one of the first three, it is equal 
to that column or its negative. But if it has inner product zero with all three 
of the first, then we find (x js y j9 z j9 Wj) = (1, — I, — 1, 1) or (— 1, 1, 1, — 1) 
since x 7 = — y, = — z } = w } from the vanishing of the inner products. Hence 
in the first subcase there is a completion that is a Hadamard matrix. 

In the second subcase there will be an inner product ±2. Without loss we 
may take this to be +2 and the inner product of the first two columns. 
Hence by Theorem 3.2 and (5.6) we may take H x in the form 




X 


l 

-l 

•• Xj ■ 

* 

l 

l 

... y. ■ 


l 

l 

... z . . 


l 

l 

... w . - 



(5.29) 


Now let us change the signs of rows of X to make the first column consist 
entirely of + l’s. Since the inner product of the first two columns of H x is 
zero, writing n = At we find in X 


2r — 3 rows, 1, 1,..., 2t — 1 rows 1, — 1,... . (5.30) 

In X we shall subdivide these rows according whether the entry in 
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column j is +1 or — 1 

column 1, 2, j 

a rows 111 

b rows 1 1—1 (5.31) 

c rows 1—1 1 

d rows 1 — 1 — 1 

From (5.30) we have 

a + b = 2t - 3, c + d = 2t - 1. (5.32) 

We now calculate the inner product of the jth column c } with the first and 
second columns c x and c 2 : 

(c u Cj) = a - b + c - d + Xj + yj + Zj + w„ = 0 

(c 2 , Cj) — a — b — c + d — Xj + y } + z } + Wj, = 0 (5.33) 

It now follows that 

Xj = —c + d — 2t— 1 — 2 c. (5.34) 

Hence Xj is an odd integer and as xj + yj + zj + wj = 4 that necessarily 
Xj — ±1. We may now adjoin the row of x’s to X to form X' an n — 3 by n 
matrix of ± l’s and by our previous case there is a Hadamard matrix H that 
is a completion of X\ This completes the proof of the second subcase for 
r = n — 4 and so all parts of the theorem. 

It is not true that every X with entries ±1 satisfying (5.2) can be 
completed to a Hadamard matrix. We shall show what some of these X’s 
are. There is, up to equivalence of permuting rows and columns, and chang¬ 
ing signs of rows and columns an unique Hadamard matrix H 4 of order 4: 

*1111“ 

H 4 = 1 1 -1 -1 . (5.35) 

1-1 1-1 
J -1 -1 1_ 

If we take the tensor product of e t = (1, ..., 1) with t l’s, t being odd, with 
H 4 we have an X, 

’l ••• 1 1—1 1—1 1 — f 

X= 1 — 1 1—1 -1 ••• -1 -1 — -1 (5.36) 

1 — 1 -1 ••• -1 1 ••• 1 -1 — -1 

1 ... i _i ... _i _i ... _ i i ... i 

where X is a 4 by 4f matrix such that XX T = Atl 4 . 

Since H 4 in (5.35) is normalized so that first column consists entirely of 
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+ l’s and since the columns are orthogonal, the sum of the entries in every 
other column is zero. Hence the sum of the four rows of H 4 is the vector 
(4, 0, 0, 0). Correspondingly the sum of the four rows of X is (4e,, 0, 0, 0). 
Consequently, a vector (xj, ..., x,,..., x 4( ) in which every x ( = + 1 cannot 
be orthogonal to all four rows X since then it would be orthogonal to their 
sum (4e ,, 0, 0, 0) and then in (x 1; ..., x,) half the entries would have to be 
+ 1 and other half — 1. But with t odd, this is impossible. In the same way 
the tensor product of e, with any Hadamard matrix H im of order 4 m will give 
an X that is 4m by 4mt which cannot be completed to a Hadamard matrix 
H Amt since the existence of a completion of X is independent of the normali¬ 
zation of i/ 4m . 

In the light of these observations the matrix A (5.37) is of interest. 


A = 


1 1 1 

1 1 1 

1 1 1 

1 1 1 

2-2 0 
2 0-2 
0 2-2 
0 0 0 

0 0 0 

0 0 0 

0 0 0 

0 0 0 


1 1 1 

1 1 1 

-1 -1 -1 

-1 -1 -1 

1-1 0 
-1 1 0 

1-1 0 
1 0 -1 

-1 0 1 

1 0-1 
-1 2-1 
- 1-1 2 


1 1 1 

-1 -1 -1 

1 1 1 

-1 -1 -1 

1 -1 0 

-1 1 0 

1-1 0 
-1 0 1 

1 0-1 

-1 0 1 

1 -2 1 

-1 -1 2 


1 1 1 

-1 -1 -1 

-1 -1 -1 

1 1 1 

0 0 0 

0 0 0 

0 0 0 

2-2 0 

2 0-2 

0 2-2 

0 0 0 

0 0 0 


(5.37) 


Here AA T = A T A = 12/ so that although the first four rows X of A are 
such that they cannot be extended to a Hadamard matrix, nevertheless A is 
an integral completion of X. 
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Lie Algebraic Proofs 

of Some Theorems on Partitions 

J. W. B. HUGHES 

QUEEN MARY COLLEGE 
LONDON, ENGLAND 


An analysis is given of the layer structure offundamental representations of the simple Lie algebras, 
expressions being given for the multiplicities of the layers in terms of the partitions (j; /, i), 
P(j ; n , n), and Q(j'; a, b , c). A general theorem due to E. B. Dynkin concerning the layer structure 
is then used to derive certain properties of these partitions; a typical example is the result 
P(j; n, n) > P(j - 1; n, n)forj < [£n(n + 1)]. 


1. Introduction 

In a previous paper [1] a proof was given of a theorem on the partition 
function (/; n — i, i) of an integer j into at most n — i integers none of which 
exceeds i , namely that, for 0 < j < [i(n — i)/ 2], ( j ; n — i , i) > (j — 1; n — i, i ); 
the connection of this theorem to the layer structure of fundamental re¬ 
presentations of the Lie algebra A(n — 1) was also mentioned. The proof was 
unfortunately quite erroneous, J and in fact properties of this nature appear 
to be quite difficult to prove using conventional means (see for instance 
Chowla [2, unsolved problem 68] and Szekeres [3, 4]). However, by a rever¬ 
sal of the argument, known properties of the structure of these representa¬ 
tions could be used to give a Lie algebraic proof of the above theorem. The 

J I am indebted to Professor G. E. Andrews of Pennsylvania State University for pointing 
this out to me. 
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theorem was in fact already proved by Elliott [5] using invariant theory of 
quantics, but by considering the layer structure of representations of the 
other simple Lie algebras, new theorems concerning different partitions can 
also be derived. 

It is the purpose of this paper, therefore, to give the analysis of the 
fundamental representations of A(n — 1), B(n ), C(n), and D(n ), and in parti¬ 
cular to calculate the multiplicities of the 7 th layers of these representations. 
In all cases they are found to equal certain partition functions, the above 
(j; n — i, i) for A(n — 1), the partition function P(j ; n , n) of j into at most n 
strictly decreasing integers none of which exceeds n in the case of the nth 
fundamental representation of B(n) and the (n — l)th and nth fundamental 
representation of D(n). For the cases of the other fundamental representa¬ 
tions of £(n), C(n), and D(n ), these multiplicities are given in terms of the 
partition function Q(j' ; a , b, c) which denotes the number of partitions of / 
into exactly a strictly increasing integers none of which exceeds b and in 
which c of the integers contribute negatively. 

Using a theorem by Dynkin [ 6 ], which states that these layers are 
“spindle shaped,” five theorems, each in two parts, are derived. The first part 
of each theorem is trivial and can be derived very simply, in the first two 
cases from elementary properties of the appropriate generating functions, 
and in the other cases from the obvious relation 


GO"; a, b, c) = G(-/; a,b,a- c). (1) 

The second parts of the theorems, on the other hand, do not appear to be 
easily derivable using conventional methods and are, as far as the author is 
aware, with the exception of Theorem lb, all quite new. Theorem 1 concerns 
(j; n — i, i); Theorem 2 concerns P(j; n, n); and Theorems 3-5 concern 

GO"; a, b, c). 

Before giving these Lie algebraic derivations, in order to place the results 
of this paper in perspective we show how some elementary properties of the 
above partitions can be derived using their generating functions. For (j; n — 
i, i) the generating function is the Gaussian polynomial 


(q) = 


_ (1 - <?")••• (1 - <T 


( 2 ) 


- £ + 1 \ i(n-i) 

n . S /. v - = Z 0; n - i, i)q j 

(i — q) (i — q) j=o 

(see Hardy and Wright [7], Rademacher [ 8 ]) where one conventionally 
defines [3](<z) = 1. The Gaussian polynomials are reciprocal and satisfy [ 8 ] 


( 1 ) = 


( 3 ) 


The reciprocal nature of ["](<j) will turn out to be precisely the content of 
Theorem la. Also, the use of (3) in (2) gives 

E O';= |.J. ( 4 ) 
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which, as we shall see in Section 2, states that the ith fundamental represen¬ 
tation of A(n — 1) has dimension ("). 

The generating function for P(j ; n, n) is 


jr ^ r < r+1 >/ 2 

r = 0 


(9) = (l + 4Ml+9 2 )-(! + «") 


n(n+l)/2 

= Z p (j'> «> n W- ( 5 ) 

j =0 

Theorem 2a will turn out to be the statement that this, too, is a reciprocal 
polynomial. Also, putting q = 1 in (5) gives 


n(n + 1)(2 


Z p 0;«>")= IL = 2 ", 


( 6 ) 

j = 0 r=0 \ f / 

which states (see Section 3) that the nth fundamental representation of B(n) 
has dimension 2". 

Finally, for ()(/; a, b, c) we have the generating functionJ 


Y\(l + + xtq~') = X Z (?(/; a, b, c)x a t c q j '. (7) 

r- 1 a, c> 0 j' = - oo 

It is worth noting here that while Q(f ; a , b , c) arises naturally in the context 
of this paper, its definition as a partition function is somewhat artificial. 
However, let Q*(f ; a, b , c) denote the number of partitions of / with each 
part no larger than b , with no part appearing more than twice, with a parts 
appearing exactly once, and with c integers used as parts. Then 


g(/ - ^ 2 1 j ; b ~ a ’ b ~ C j = 4 ( 8 ) 

This is obvious from the following elementary identities: 

1 e(/ - r 2 4 

= / r W Z e(/ - (* + *) 5 «, b ' c ) x_ ' r V 

/b + 1 \ b 

= <r 2 'xVfl(1 + x -1 <j r + x _I r _1 q _r ) 

r= 1 
b 

= n«i + x - Y + x _, r r ) 


= fl (1 + Xtq r + tq 2r ) = Z £>*(/; a, b, c)x°fV- 


( 9 ) 


J I should like to thank the referee for pointing this out to me. 
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Thus Q is only a superficially altered Q*, and Q* is a very natural partition 
function. 

If in Eq. (7) we put q = 1 and equate coefficients of x a t c , we obtain 


S GO"; a, b, c) = 



( 10 ) 


This result, as we shall see later, may be used to calculate the number of 
distinct weights belonging to shells of fundamental representations of the Lie 
algebras B(n), C(n), and D(n). 

In dealing with A(n - 1) in Section 2, we also summarize, without proofs, 
those properties of simple Lie algebras that are needed in order to calculate 
the layer multiplicities. The Lie algebras B(n), C(n), and D(n) are treated 
more briefly in Sections 3, 4, and 5, respectively. 


2 . A(n - 1 ) 

In this section we consider the case of A(n - 1), which we treat in some 
detail in order to summarize well-known facts about simple Lie algebras. 
Facts valid for arbitrary simple Lie algebras will be given under the heading 
of “Property,” and their proofs will not be given here since they are readily 
available in the literature [6, 9]. We shall reserve the title of “Theorem” for 
properties directly related to partitions. The forms used here for the roots 
and weights of A(n - 1) will be derived, whereas in the following sections 
their forms for B(n), C(n), and D(n) will be written down without deriva¬ 
tions; all the forms for the roots and weights used here are given in [6]. 

We shall use a realization of the (n 2 - l)-dimensional Lie algebra 
A(n — 1) in terms of n x n matrices, choosing as its basis the traceless dia¬ 
gonal matrices h, , i = 1 , ..., n, where h t has (n — 1 )/n as its (i, i)th element 
and - 1/n for its other diagonal elements, and the e jk ,j =f=k = 1,..., tt, where 
e Jk has 1 for its (j, k)th element and 0 for its other elements. The h t , which 
span the Cartan subalgebra of A(n — 1), satisfy the relation h t + ••• + 
h n = 0, so only n — 1 of them are linearly independent. We find it convenient 
to use the above linearly dependent set as opposed to a linearly independent 
subset since they give rise to extremely simple forms for the roots. These 
forms will turn out to be n-tuples, like the forms for the roots of the other Lie 
algebras, and it is for this reason that we consider A(n — I) rather than A(n). 

The commutation relations of the /i,, e jk are 

[hi , hj] = 0, i,j= 1 ,.... n, (11) 

[hi, e Jk ] = (Sn - d ik )e jk , i= (12) 
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and 

hj-h k , l = k, m=j 

e jm , l = k, m±j 

(13) 

-e lk , l^k, m=j 

0, l±k, m^j. 

Using the shorthand notation h = (h u h n ), we may write (12) as 

[h, e Jk ] = <Xj k e jk (14) 

and the first of Eqs. (13) as 

l e jk, e kJ ] = a jk • h (15) 

where 

« ik =(0,...,0, 1,0,...,0, -1,0, ...,0) = (1, -1). (16) 

The n(n — 1) n-tuples a jk are the roots of A(n — 1). Clearly <x kj = —a jk and 
a jjt * == 2; they also lie on the (n — l)-dimensional hyperplane 

Xi + • • + x w = 0, 

so only n — 1 of them are linearly independent. We shall choose a basis for 
this hyperplane shortly. 

Definition 1 (a) A real nonzero n-tuple L is called “positive” if its first 

nonzero component, reading from left to right, is positive; otherwise it is 
called “negative.” 

(b) If L and L' are two real n-tuples, L is said to be “higher” than L' if 
the n-tuple L — L' is positive. It is easy to see that 0 L jk is a positive root if 
j < fc, and that if a jk and a /m are positive, CL jk is higher than a lm if either j < /, 
or j = l and m <k. 

Definition 2 A positive root is called “simple” if it cannot be expressed 
as the sum of two positive roots. 

Property 1 The simple roots of a simple Lie algebra form a basis for the 
Lie algebra’s root space, the other roots being linear combinations with 
integral coefficients of the simple roots. 

Definition 3 The number of simple roots is called the “rank” of the Lie 
algebra. 




140 


J. W. B. HUGHES 


A(n - 1) has rank n — 1, its simple roots being a f = a,, j+1 , i = 1, 
n - 1, as can easily be verified. An arbitrary positive root a jk is given in 
terms of them by 

< 17 ) 

i=j 

Definition 4 The group generated by the reflections in hyperplanes per¬ 
pendicular to the roots of a simple Lie algebra is called the “Weyl group” of 
the Lie algebra. 

Property 2 The Weyl group leaves the set of roots of a simple Lie 
algebra invariant. 

Let L = (l u ..., /„), and S jk L be the n-tuple obtained from L by reflecting 
it in the hyperplane perpendicular to <x jk , so 

SjkL = L - a jk = L - (at* • L)a* . (18) 

V*jk a jk) 

Using Eq. (16) for <x Jk , we obtain 

j * 

SjfcL = (/j, . . • , 4 9 • • • 9 4 9 ■ * * 9 Ini 

so S jk just permutes the jth and feth components of L. The Weyl group for 
A(n — 1) therefore consists of all permutations of the components of L and 
so is isomorphic to the symmetric group S(n). This clearly leaves invariant 
the set of roots of A(n - 1), and in fact any two roots may be connected by at 
most two reflections: a /TO = S j7 S mk a jk . 

We now consider the classification of irreducible representations of 
A(n — 1): Let D be an iV-dimensional irreducible representation of A(n - 1) 
with representatives 

D(hi) = D(e jk ) = E jk 

satisfying the hermiticity conditions 

H] = H i , E) k = E kJ . (19) 

These are precisely the conditions required in order that the exponential 
map of D yields a unitary representation of the unimodular unitary group 
su (rt). 

We denote the carrier space for D by R and choose its basis to consist of 
simultaneous eigenvectors of the H t : 

i = 1, n, p = 1, N. 


H^p = Piptyp, 
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Denoting M p = \i np ), this equation may be written in the compact 

form 

Hi/s p = M p il, p9 (20) 

M p is called a “weight” of the representation D ; we denote the set of all 
weights of D by W(D). Because of the hermiticity of the H t (with respect to 
some inner product on R ), elements of W(D) are real n-tuples; furthermore, 
from H t + * • • + H n = 0, we obtain p lp + • • • + p np = 0, p = 1, ..., N, so 
elements of W(D) belong to the root hyperplane. 

Definition 5 The “multiplicity” of a weight A e W(D\ denoted y A , is 
the number of linearly independent corresponding basis vectors e R. 

Definition 6 A weight A is “simple” if y A = 1. 

We employ the same method of ordering weights as we did for arbitrary 
n-tuples, and thereby allow consideration of the highest weight of an irredu¬ 
cible representation D. 

Property 3 The highest weight of an irreducible representation of a 
simple Lie algebra is simple; and if two irreducible representations have the 
same highest weight, they are equivalent. 

The problem of classifying irreducible representations of a simple Lie 
algebra is therefore equivalent to the problem of classifying its highest 
weights. 

Property 4 Any two weights e W(D) differ by an integral linear com¬ 
bination of the roots and therefore also, by Property 1, by an integral linear 
combination of the simple roots. 

In particular, therefore, given the highest weight A of W(D ), any other 
weight e W(D) may be obtained by a suitable subtraction of simple roots 
from A. 

Property 5 If Mg W(D) and a is a root of the Lie algebra, then 
2(a • M)/(a * a) is an integer, and S p M = M - (2(a • M)/(a • a))a e W(D). 
This implies that W(D) is invariant under the action of elements of the Weyl 
group. 

Definition 7 Two weights are called “equivalent” if they can be ob¬ 
tained from each other by an element of the Weyl group. The set of all 
weights equivalent to a given weight is called a “shell” of W(D). 

For A(n — 1), therefore, the shell containing a weight M consists of all 
n-tuples obtained by permuting the components of M. 




142 


J. W. B. HUGHES 


Property 6 If p is an integer such that 0 < p < (a • M)/(a • a), then: 

(a) y(M - pot) = 7(5. M + pot); 

(b) y(M - pot) > y(M - (p - l)ot). 

This property implies that a chain of weights, M, M — ot,..., 
M - (2(ot • M)/(ot • ot))ot, is spindle shaped, and in particular that equivalent 
weights have the same multiplicity. 

Definition 8 The highest weight in a shell is called a “dominant weight.” 
Clearly the highest weight A is itself a dominant weight. 

For the case of A(n - 1), let M = (p 1? ..., p„) be a dominant weight, and 
suppose fij < fi k for some j < k. Then 

j k j k 

Sjk^l M = (• • • 9 Pfc 9 - 9 Pj 5 * ‘ ♦) (* • • 9 P/ 9 * • * 9 Pfc 9 • • •) 

j k 

= (0, ..., 0, p fc - p ; , 0 ,..., 0, fij - p fc , 0, ..., 0 ) 

which is clearly positive; since S jk M is equivalent to M, this contradicts the 
assumption that M is a dominant weight. Hence if M is a dominant weight, 
its components satisfy the inequalities pj > p 2 > * > P„* 

Now let M be an arbitrary weight e W(D ); by Property 5 applied to 
A(n — 1), M • a jk is an integer for all a jfc , so p, — p k is an integer for all 
j, k = 1, ..., n. This means that all components of M have the same frac¬ 
tional part, and also, by Property 4, that the components of all weights 
e W(D) have the same fractional part. 

The next property, due to Freudenthal [9], enables one to calculate the 
multiplicity of an arbitrary weight. 

Property 7 Let a be a positive root of a simple Lie algebra, 8 half the 
sum of all the positive roots, and A and M the highest weight and an arbitrary 
weight, respectively, of W(D). Then 

2 £,>0 ST-i y(M + fa»)((M + <«)•«) (2U 

n ' (A + 8) • (A + 8) — (M + 8) • (M + S) V ] 

where y(M + ka) = 0 if M + ka $ W(D). 

We now consider the classification of highest weights A of a simple Lie 
algebra, denoting the irreducible representations corresponding to A by 
D( A), and fL(D(A)) by fL(A). Now since A lies in the root hyperplane, it is 
uniquely determined by the numbers g { = 2A • a,- /a, • a ; , where a, are the 
simple roots. Due to Property 5 and the fact that A is a highest weight, these 
are all nonnegative integers. The numbers g { themselves therefore give a 
unique label for the irreducible representations. For A(n — 1) it is easy to 
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show that if A = (AA n ) is the highest weight of an irreducible represen¬ 
tation, then = (A £ — A i+ r ), i = 1, ..., n - 1. 

Definition 9 A nonzero highest weight that cannot be expressed as the 
sum of two nonzero highest weights is called a “fundamental” highest 
weight, and the corresponding representation a “fundamental” 
representation. 

Property 8 The fundamental highest weights are A‘, for which gj = 5 U . 
There are therefore as many fundamental highest weights as there are simple 
roots and can therefore also be chosen as a basis for the root or weight 
space; in fact if A is any highest weight, then A = Yj Qi A 1 . 

For A(n — 1) there are n — 1 fundamental weights and corresponding 
representations; suppose A 1 = (A l5 ..., A„), then since A 1 • a j = 5 U we have 
X x = * * * = = A i+ j + 1 = • • • = k n + 1, and since A 1 lies in the root hyper¬ 

plane + • * * + X n = 0. Solving for gives = (n — i)/n, so 


1 i i+l n 



Let N( A) be the dimension of D(A); now since the simple roots are a 
basis for the root space, we may write 

A = £/,«, (22) 

i 

where the sum extends over all simple roots (i = 1,..., n — 1 for A(n — 1) 
and i = 1,..., n for the other simple Lie algebras). In general, of course, the /, 
will differ from the g { above. Also, an arbitrary weight M e W( A) may be 
written 

M = £ a i, (23) 

i 

SO 

A — M = £ (/,• — WiJotj, 

i 

which simply expresses the fact that M may be obtained from A by the 
subtraction of a suitable number of simple roots. Clearly, then, l { — rrii must 
be a nonnegative integer. Let 

MA, M) = X (li - m t ), (24) 

i 

so g(A, M) is the total number of subtractions of simple roots needed to give 
M from A. Let A' be the lowest weight g W( A); if A = (A l9 X 2 ,..., A„), then 
for A(n - 1) clearly A' = (k n , ..., k 2 , AJ. 

Definition 10 The “height” T(A) of D( A) is defined by T( A) = /i(A, A'). 
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Definition 11 A “layer” of 1T(A) is a subset co/A) consisting of all 
M e IT(A) such that n(\, M) = j. 

(Oj( A) therefore consists of all M obtainable from A by the subtraction of 
j (not necessarily distinct) simple roots. Clearly the number of layers of 
W(A) = T( A) + 1 and fF(A) = [j ]i\j} co,(A). 

Let s,(A) denote the number of weights e a>j( A), in which each weight is 
counted a number of times equal to its multiplicity. Then N( A) = 
YJL'o •Sj(A), and because of the simplicity of A and A', we have 
s 0 (A) = s T(A) (A) - 1. 

Property 9 The layers of IT(A) are spindle shaped, i.e., s/A) = 
s TW -j( A) and s 0 (A) < Sj(A) < < s r (A) where r = [|T(A)] (the largest 

integer <\T( A)). 

It is this property that we shall apply to the fundamental representations 
to derive our theorems on various types of partitions of integers. 

We now consider the case of A(n — 1) and show that Sj(. A‘) equals the 
partition (j; n — i, i) of j into at most n - i positive integers none of which 
exceeds i. Now 

1 i i + 1 n 

A 1 = (a, ..., a, a - 1,..., a - 1) where a = (n - i)/n, 

and the only simple root that may be subtracted from it is a,, and this may 
be subtracted only once to give 

1 i — 1 i i + 1 i + 2 n 

A 1 — a i = a , a — 1, a , a - 1, .. a - 1) 

which is equivalent to A 1 . Next, the only simple roots that one may subtract 
(again just once) from A 1 — a* are a l± l9 again to yield weights equivalent to 
A*. Repeating this process it is easy to see that if Mg W( A 1 ), the only 
permissible subtraction of a simple root from it is one which interchanges an 
a - 1 with an a immediately preceding it, this being achieved by a single 
subtraction. This shows that every weight e ^(A 1 ) is equivalent to A 1 , and 
therefore simple. The dimension of ^(A 1 ) therefore equals the number of 
distinct permutations of the components of A 1 , and this equals the number of 
ways of choosing i components to be a, i.e., 

iV(A')=(”). (25) 

Next, T( A 1 ) equals the number of subtractions of a simple root from A‘ to 
obtain the lowest weight 

1 n-i n—i+1 n 

A' 1 = (a — 1,..., a - 1, a ,...,a). 
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Clearly, A'* is obtained from A 1 by i interchanges of the first a — 1 (reading 
from the left) with the preceding a, followed by i similar interchanges of the 
second a — 1, and so on. Each step in the process involves the subtraction of 
a single simple root, so since there are n — i (a — l)’s, we have 

T(A f ) = i(n - i). (26) 

Finally, we calculate s^A 1 ); since each weight of W( A 1 ) is simple, this 
equals the number of weights e coj( A 1 ). We must therefore calculate the 
number of distinct ways of subtracting; simple roots from A*, in other words 
the number of distinct ways of successively interchanging j (a — l)’s with 
preceding a’s. Suppose, for / = 1,..., n — i, the /th a — 1 undergoes k t such 
successive interchanges; clearly the (Z + l)th a — 1 cannot have more such 
interchanges than the Zth a — 1, and since there are i a' s, no a — 1 may 
undergo more than i interchanges, s/A f ) is therefore the number of distinct 
solutions of the equation + • • • + /c„_ f = j subject to the conditions 
k x > k 2 > • • * > k n -i and k t < i for l = 1,..., n — 1. But this is just the parti¬ 
tion function (;'; n — i , i); hence 

S ;( A 0 = O'; n - i, i ). (27) 

Equations (25) and (26) and Property 9 now imply the following proper¬ 
ties of (;; n — i, i): 


i («~0 

I O'; 


j=o 


n - i, i) = 



which is just the Eq. (4) trivially derived using generating functions in Sec¬ 
tion 1, and 


Theorem 1 

(a) (j ; n - i, i) = (i(n - i) - j;n - i, i); 

( b ) 0; w - i, i) > (j - 1, n - i, i) for 0 < j < [i(n - i)/2\ 

Theorem la is, as mentioned in Section 1, merely a statement of the recipro¬ 
cal nature of the Gaussian polynomial ["](<?). Theorem lb, on the other hand, 
is not trivial and although it was in fact proved a long time ago by Elliott [5] 
using invariant theory of quantics; it does not appear to be very widely 
known. 


3. B(n) 

Having in the preceding section given the general method for deriving 
the simple roots and fundamental weights of a Lie algebra, we shall content 
ourselves in this and the following sections with writing them down without 
derivation (these can all be found in [6]). B(n) is the Lie algebra of the group, 
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0(2n + 1) of (2« + 1) x (2n + 1) orthogonal matrices, and has rank n. Its 
positive roots are n-tuples of the form 

j * ; 

(0, ..., 0, 1, 0, ..., 0 + 1, 0,..., 0) and (0, 0, 1, 0, 0), 

the simple roots being 

i i +1 

«i, = (0,0, 1, - 1, 0, 0) for i = 1,.... n — 1, 

and ot n = (0, 0, 1), so 2(a, • L)/(a, • a,) = a,- • L, i = 1, n — 1, and 

2(<x„ • L)/(a„ • a„) = 2(a„ • L). Using 2(A*' • «,)/(*,• • a,) = <5 U; we find the 
fundamental highest weights to be 

A* = (1,1, 0 , 0), i = 1, ..., n — 1, and A" = (i ..., i). 

We treat the nth fundamental representation separately from the rest, deal¬ 
ing with it first. 

By Property 5, the only simple root that may be subtracted from A" is a .„, 
this being subtractible only once to give A" — a„ = (i, ..., i, — which is 
equivalent to A". a„_, may be subtracted once from this to give the weight 
..., \), and from this, one may either subtract a„_ 2 to give (i,..., i, 

~i i i), or a„ again to give (i ..., ~i). It is easy to see, in fact, that 

an arbitrary weight M e IP(A n ) has either i or for its components, a 
subtraction of a simple root being equivalent to either interchanging a — \ 
with an immediately preceding \ or to changing an nth \ into a At no 
stage can a simple root be subtracted more than once, so each weight is 
equivalent to A" and therefore simple. Clearly, the lowest weight is 
A" = , , x 

Now the number of weights with / ~i’s is (7), so the dimension of D( A") 
is given by 

MA")=Z o (”)= 2". (28) 

Also, in order to obtain A'” from A" one must change the nth \ into a and 
then permute it step by step with the immediately preceding -5 until one 
obtains, after n simple subtractions, (~i i • - ■, i)- Repeating the process for 
this weight one obtains, after a further n — 1 simple subtractions, (— j, — i, 2> 
..., A) and so on. By the time A'" is attained, the total number of simple 
subtractions undergone is n + (n — 1) + • + 2 + 1, so 

T( A") = in(n + 1). (29) 

Next, Sj(A") is the number of distinct ways of subtracting; simple roots 
from A"; suppose, for some M e co^A"), the /th —-5 is in the (n — k t + l)th 
position, where the — -j’s are numbered from the left and we interpret a — j 
in the (n + l)th position as a \ in the nth position. The replacement of a \ in 
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this position by the is achieved by k t simple subtractions. Clearly, no k t 
can exceed n, and k t > k i+1 unless they are both zero; therefore j = k x + 
••• + k nr and 

Sj( A") = P(j ; n, n), (30) 

where P(/‘; n, n) is the number of partitions of j into at most n strictly 
decreasing integers, none of which exceeds n. 

Equations (29) and (30) now imply that 

rt(n + l)/2 

X P(j;n,n) = 2 n 
j =o 

which is just Eq. (6) derived using the generating function of P(j ; n, n); and 
Property 9 implies: 

Theorem 2 

(a) P(j ; n, n) = P(i«(n + 1) - 7 ; n, n); 

(b) P(y; n, n) > P(;' - 1; n, n) for j < [>(n + 1)]. 

(a) is, again, merely a statement of the reciprocal nature of the generating 
function for P(j ; n, «); (b) appears to be new. 

We now consider the general case of A‘, i ± n. By the subtraction of 
simple roots a f , i ± n, we obtain all weights whose components are permuta¬ 
tions of those of \\ and so which have i components equal to 1 and n — i 
components equal to zero. These weights are all equivalent to A' since at any 
given time a given <x { may be subtracted only once. For any such weight with 
1 for its nth component we may also subtract a„ either once to give a weight 
with 0 for its nth component, or twice to give a weight with — 1 for its nth 
component. The latter weight is equivalent to the original weight and there¬ 
fore also to A 1 , whereas the former weight is not. 

Proceeding in this manner we see that W( A 1 ) consists of all weights with 
a number <i of components equal to ± 1, the rest of the components being 
zero. For any such weight a t may be subtracted once if the ith component is 
1 and the (i + l)th component is zero or if the ith component is zero and the 
(i + l)th component is — 1, but if the ith component is 1 and the (i + l)th 
component is — 1, a, may be subtracted either once to give a weight with two 
more zero components or twice to give a weight with the 1 and — 1 inter- 
charged. If the nth component is 1, a„ may be subtracted either once to give a 
weight with one more zero component or twice to give a weight with the nth 
component equal to — 1. Weights with the same number of + l’s are equiva¬ 
lent and so belong to the same shell. Let G k denote the shell consisting of 
weights with i — k components equal to + 1 and n — i + k zero components; 
clearly 

MA f ) = 0 G * • 

k = 0 


( 31 ) 



148 


J. W. B. HUGHES 


The lowest weight 

1 i n 

A ' 1 = ( — 1, -1, 0, ..., 0) e G 0 , 

and the dominant weight of G k is 

M k = (l,..., V, 0, ..., 0). 

We use M k to calculate the multiplicity of weights e G k , using Freudenthal’s 
formula, Eq. (21), and denoting y(M k ) by N k for short. First, using 
8 = i(2 n - 1, 2n - 3, ..., 3, 1) we obtain (A 1 + 8 ) • (A 1 + 8 ) - (M k + 8 ) • 
(M k + 8 ) = k(2n — 2i + k + 1). Secondly, n — i + k positive roots of the 
type (0, ..., 0, 1, 0,..., 0) may be added to M k once only to give a weight of 
G k _ j, whereas (n — i + fc)(n — i + k — 1) positive roots of the type (0,..., 0, 
1, 0,..., 0, ± 1, 0,..., 0) may be added once only to give a weight of G k _ 2 . In 
each such case, (M k + a) • a = (a • a), so substituting into Eq. (21) we 
obtain the recursion relation 

k(2n — 2i + k 4 - 1 )N k = 2(n — i + k)(2(n — i + k — 1 )iV k _ 2 + N k _ t ), 

k > 1 , 

where JV_ x =0. Using N 0 = 1 we obtain easily = 1; the solution of the 
recursion relation subject to these conditions is 

In — i + 2k\ In — i + 2k + 1 \ 

, )• ( k ) (“I 

Consider G k ; weights in this shell have (n - i + k) zero components 
which may be in any of ” k ) distinct arrangements. For any one of these 
arrangements, suppose there are j Vs and (i — k — y) — l’s; there are ( l ~f) 
such arrangements of the ± l’s, so the total number of weights with 
(n — i + k) 0 ’s in any given arrangement is 



Hence the number of distinct weights in G k is " k )2 l ~ k and the multiplicity of 
G k is N k ( i 2 k )2 i ~ k . Summing over all shells we obtain the dimension of D(\ l ); 
this is given in Table 30 of [6] as ( 2n t J ), which is therefore the value of the 
above sum: 

’■ <331 

We next calculate the height of let 

Aj- = (— 1,..., -1, 1,..., 1, 0, 0) 
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so that A' = Aq, A'‘ = A}; now 

Aj = A‘_! - 2(0, 0, 1, 0,0) = Aj_ t - 2(<Xj + - + «„), 

so Ay is 2 (n—j+ 1) layers down from Aj_ x . Hence A- is 2(n + (n — 1) 4- 
••■ + (n — i + 1)) layers down from Aq, i.e., 

TfA') = i(2w - i + 1). (34) 

We now consider an arbitrary weight of G k and determine to which layer 
it belongs by calculating how many layers down it is from M*. M* itself can 
easily be shown, in a manner analogous to that used to calculate T( A‘), to 
belong to co A(2n _2i+fc+ i ) / 2 (A i )- Let M*(/„ ..., !,_*) be the weight e G k with 
i — k l’s in which, for X = 1,..., i — k, the ^th 1 is, reading from the left, in 
the / A th position (so 1 < < l 2 < • • • < < n). To get to this weight from 

M k , the >lth 1 must be moved forward l x — k places, each such movement by 
one place being achieved by a single subtraction of a simple root. M k (l u ..., 
/ f _ k ) is therefore 

Z (h -*)= £ lx - i(i - k)(i -k + 1) 

A=1 A= 1 

layers down from M k . 

Next, let be the weight with (i — k) ± Ts of which h (0 < h < 

i — k) are — l’s such that the 0’s are in the same position as in M*(/i, ..., 
/,_*), and such that, for n = 1, ..., h, the /ith —1 is in the l m th position, 
where 1 < m, < m 2 < • ■ ■ < m h < i — k. Now since it takes 2(n — l mii + 1) 
simple subtractions to replace 1 in the l m th position by a — 1, to get from 
M k (l u ..., /,-fe) to Mjft;—’"-*) requires 

2 £ (« - L„ + 1) = 2h(n + 1) - 2 £ /„„ 

1 H =1 

simple subtractions. The number of simple subtractions needed to get from 
M k to is therefore 

Z ^ ~ 2 ^ l mfl — i(i — k)(i — k + 1) + 2h(n -f 1) 

A= 1 fi = 1 

= £ & h Jx - W - k)(i - k + 1) + 2h(n + 1) 

A=1 

where h of the are — 1 and the rest are + L Finally, therefore, 
is on coj , where 

J = £ + (n + l)(2h + k)~ |i(i + 1). 

A= 1 
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Let Q(f ; a, b, c) be the number of partitions of / into exactly a strictly 
increasing positive integers none of which exceeds b and in which c of the 
integers contribute negatively. From the above it is easy to see that 

S/A*) = £ N k YQU - (« + 1)(2 h + k) + ji(i + 1); i - k, n, h). (35) 

k= 0 h=0 

Using Eq. (34) we see that 

i(2n— i+1) i — k 

X X 60 — (« + l)(2h + k) + ii(i + 1); i — k, n, h) 

j=0 h =o 

is the number of distinct weights e G k . We have already seen that this 
number is ( i " k )2 i ~ k , and the equality of these two expressions can easily be 
shown using Eq. (8), with the appropriate values for /, a, b, and c (j' min and 
./max are easily seen to correspond to / = 0 and j = i(2n — i+1), respectively). 
Clearly, Eq. (10) could have been used to derive the formula for the number 
of distinct weights of G k , but no great advantage would have ensued from 
doing so. 

Property 9 of the last section shows the following to hold: 

Theorem 3 For i = 1, ..., n — 1, 

(a) X N k X 60' - (« + l)(2/i + k) + \i(i + 1); i - k, n, h) 

k= 0 h=0 

= i N k x 6(/( 2n -i+1) -j - (n + 1)(2 h + k) + ji(i + 1); i - k, n, h). 

k = 0 h = 0 

' lQ(j - (n + 1)(2 h + k) + \i{i + 1); i - k, n, h) 

( ’ k % \=o I -60 “ 1 “ (» + 0(2/1 + k) + ii(i + 1 ); i - k, n, h) 

for 0 < j < ji(2n — i+1). 

As mentioned in Section 1, (a) follows easily from the property 
Q(f; a, b, c) = Q{-f; a, b, a- c) when the appropriate values of /, a, b, 
and c are taken, and in fact the equation holds with the X*=o N k omitted so 
that the shells themselves have a symmetric layer structure. This is a fact that 
does not follow from Dynkin’s theorem; the author does not know whether 
it is generally known. No previous proof of (b) appears to have been given. 

4. C(n) 

C(n) is the Lie algebra of the group S p (2n) of symplectic matrices of order 
2n, and again has rank n. Its positive roots are again n-tuples of the form 

j * 

( 0 , ..., 0 , 1 , 0 , ..., 0 , ± 1 , 0 , ..., 0 ) 
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(like B(n)) and (unlike B(n)) 

( 0 , 0 , 2 , 0 ,..., 0 ), 


the simple roots being 

i i + 1 

a, = ( 0 , .... 0 , 1 , - 1 , 0 , 0 ) and «„ = ( 0 , 0 , 2 ); 

here 2(a„ • L)/(a„ • a„) = |(a„ • L). Using 2(A* • <*;)/(«; • a j) = <5 U we find the 
fundamental highest weights to be 


A* — (1, ..1, *0.0) 

for i = 1, ..., n. 

The analysis of D(A‘), i = 1, ..., n, is very similar to that of D(\% i = 1, 
..., n — 1 of B(n\ the weights again having either + 1 or 0 for their compon¬ 
ents. However, a„ may be subtracted only once from a weight with 1 as its 
nth component, to give a weight with — 1 for its nth component, so a weight 
cannot have its number of zero components increased by one. Whenever a 
weight has a 1 immediately preceding a — 1 , it is still possible to subtract an 
a i, n , twice, one subtraction resulting in a weight with two more zero 
components. The shell structure of W( A*) is therefore similar to the case of 
B(n ), except that only the G 2k , k = 0, ..., [i/2], occur, i.e., 

Hi 2] 

MA')=UG 2 *- (36) 

k = 0 

Using the dominant weight 

1 i — 2k 

M 2 * = (!,..., 1 , 0 , ..., 0 ) 


to calculate the multiplicity of weights of G 2fc , and the expression 6 = (n, 
n — 1 , ..., 2 , 1 ), we find that this multiplicity is ((n — i + l)/(n — i + 
k + l))AT 2fc , where N 2k is as given in Eq. (32). This implies that G lk has total 
dimension 

(” — »+!) (n-i + 2kU n \ 

(n — i + fc + 1 )\ k — 2k f 

The total dimension of D( A 1 ) is therefore 


[*/ 2 ] 


AT(A‘-)= X 

k = 0 


(n-i+l) in - i + 2k\[ n \ ,_ 2k 
(n — i + k + 1 )\ k )\i-2k) 
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Letting 

Aj = ( — 1,..-1, 1,..., 1, 0,0), 

we find that for C(n), A} = A}_ x - 2(<Xj ; + ••• + ^-j) - a w , so A} is 
{ 2 (n — j) + 1 } layers down from A* _ x ; using this we find that the height of 
W( A 1 ) is given by 

T( A') = i(2n - i). (38) 

With the same notation as in the preceding section, and by highly similar 
methods, it can easily be proved that an arbitrary weight of the type 
’ where k = 0 ,..., [i/2] and h = 0 ,..., i — 2k, is on the layer coj, 

where now 


j = Z + ( 2n + l )( h + k ) - ¥(t + i) 

A= 1 


and hence 


7 *?o(«-i + fc + l) 2 \?o 

x Q(j ~ (2n + l)(h + &) + ji(i + 1); i - 2k, n, h). 
Property 9 of Section 2 implies the following: 


Theorem 4 For i = 1,_n, 

[i/2] ( n _ j + 1\ i~2k 

(a > 

x Q(; — (2 h + l)(/z + fc) + + 1 ); z — 2k, n , /z) 

‘■72! ( n -i + l) 

*tt,(«-i + fe + i) “A 

x Q(z( 2 n — i) — j — (2 n + l)(h + k)+ ji(i + 1 ); i — 2k, n, h). 


[i/2] 


(b) Z 


(n — i + 1) 
o (n — i + k + 1) 


i — 2k 

N 2k Z 


h = 0 


I Q(j - (2 n + 1 )(h + k) + ji(i + 1); i - 2k, n, h) 

\-Q(j - 1 - (2 n+ 1 )(h + k) + ji(i + 1); i - 2k, n, h) 


> 0, for 0 <j < [ji(2n - i)]. 
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As in the case of B(n ), (a) holds with the k summation omitted, as can 
easily be proved using Q(j ; a, b , c) = Q(—jl a, b, a — c ), so that the shells 
themselves are layer symmetric, whereas (b) appears to be new. 


5. D(n) 


D(n) is the Lie algebra of the group 0(2n), and has rank n. Its 
positive roots are n-tuples of the form 

j k 

( 0 , 1 , 0 ,..., 0 , ± 1 , 0 , ..., 0 ) 

and the simple roots are 

i ;+ i 

otj = (0,..,, 0, 1, - 1, 0,..., 0), i±n 


and 


a„ = (0, ..., 0, 1, 1). 

The fundamental weights are easily found to be 

1 i i + 1 n 

A 1 = (1,..., 1, 0 ,..., 0) for i = 1, ..., n — 2, 

A” 1 = (^, ..., — -j) and A” = (^,..., -j). 

^(A"" 1 ) and W( A”) are rather similar in structure to W(A n ) of B(n ), both 
having dimension 2 n_ 1 and height (n(n — l)/2). S;(A" - r ) and s^A”) are both 
equal to P(j ; n — 1, n — 1), so application of Property 9 yields Theorem 2, 
with n replaced by n — 1. Nothing new is therefore obtained from their 
analysis. 

For i <n — 2, W(A i ) has a similar structure to W(A l ) of C(n ), with shells 
G 2 k , k = 0,..., [i/2], consisting of all n-tuples with n — i + 2k zero compo¬ 
nents. Freudenthal’s formula, with 5 = (n — 1, n — 2, ..., 1, 0), implies that 
weights of G 2 k have multiplicity N 2k , given in Eq. (32), so here G 2k has total 
dimension 


n + “H, 


Summing over all shells, we obtain the following expression for the total 
dimension of D(A i ): 


^=I("-r)l =(•)• 


i = 1 ,..., n — 2. 


( 39 ) 
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An analogous calculation to those used for B(n) and C(n) leads to 

T(A f ) = i(2n — i — 1) (40) 

and an arbitrary weight of G 2 k of type k — 0,..., [i/2], h = 0, 

2/t, is found to be on the layer to, where here 

i-2k 

j= S^I A + 2n(/c + h)-ii(i+l) 

A= 1 


SO 

[i/2J i-2k 

Sj{A 1 ) = X iV 2fc X Q(j ~ 2n(* + fc ) + + 1); i - *) ( 41 ) 

fc = 0 h=0 

Property 9 implies the following: 

Theorem 5 For i — 1, ..., n — 2, 

[i/2] i-2* 

(a) ^ ^ Q(j — 2n(h + fe) + ii(i + 1); i — 2/c, m, /t) 

k = 0 /i = 0 

[i/2] i-2fc 

= L N 2 * X 2('(2n - i - 1) -j - 2 n(h + k) 

k=0 h=0 

+ \ i(i + l)j i 2k, n, fa) 

w* ‘-™lQ(j-2n(h + k) + $i(i+iy,i-2k,n,h) I 

( ' k =o 2k /,=o I - Q(j - 1 - 2n(fa + k) + ji(i + 1); i - 2k, n, fa)) 

>0, for 0 <j < ji(2n — i — 1). 

Again, using Q(j; a, b, c) = (?(-;; a, b, a - c), (a) could be proved very 
easily even with the Yj!=o N 2 k omitted, showing the shells themselves to be 
layer symmetric, whereas (b) appears to be new. It should be stressed that in 
(b) of Theorems 3-5, neither the k nor the fa summations may be omitted 
without invalidating the formulas, as simple counterexamples will soon 
verify. 

To summarize, using one theorem concerning representations of the 
simple Lie algebras, five theorems concerning partitions have been obtained, 
the part (a) of the theorems being trivially derivable using usual methods, 
but part (b) being nontrivial. On the other hand, using elementary properties 
of partitions we have seen that the symmetric part of Dynkin’s theorem 
actually holds not just for the representations but also, in the case of the 
fundamental representations of A(n), B(n), C(n), and D(n), for the shells of 
the representations. 
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The question as to whether new properties of partitions might ensue by 
applying Dynkin’s theorems to arbitrary irreducible, or even reducible, re¬ 
presentations of the simple Lie algebras, including in this case also the 
exceptional Lie algebras, might be worthy of further investigation. 

I should like to thank Professor G. E. Andrews of the Pennsylvania State 
University for encouraging me to write this paper, and for several extremely 
helpful communications. 
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1 . 


For s = o + it, L(s) stands throughout this paper for L(s, Xu 4), given for 
a > 1 by 


y Xl(M) _ y (~ir 

k n° „k(2n+ir 


( 1 . 1 ) 


In our paper [1] we showed that if the Riemann-Piltz conjecture holds for 
^(5)5 i*c., 

L(s) ^ 0 for a > (1.2) 


then for suitable explicitly calculable positive constants A x , A 2 , and A 3 the 
inequality 

X (— 1) <P ~ 1)/2 logp exp(-At log 2 -) < -A 2s Jx (1.3) 

P>2 \ X J 

holds for all x > A 3 . This fact gives great interest to the problem to find 


t Deceased. 
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explicitly long sequences of consecutive primes p v , p v+ u p v + 2 > • • •, P v +k with 

Pv = P^i = "• = P v +b = 1 mod 4. (1.4) 

The longest such sequence below 10 6 was found by Den Haan in Eindhoven 
consisting of 11 terms; it would be very desirable to find longer such se¬ 
quences, below 10 8 , say. 

2 . 


This problem suggests a number of exciting questions if we add the 
qualification “infinitely often.” The simplest such problem is obviously that 
concerning 

def 

fi(x) = II (2.1) 

Pv <X 

Pv- I =p v = 1 (4) 

if x oo. When asked about the possibility of finding an elementary proof of 
the relation 

lim/ 2 (x) = + 00 (2.2) 

x-*ao 

some years ago, Erdos was definitely very pessimistic about it. As remarked 
by Ingham on the last page of his book [2], Littlewood’s method for proving 



gives—with the usual notations— mutatis mutandis 

lim{7i(x, 4, 1) — n(x, 4, 3)} = +oo 

JC-»00 

lim{7i(x, 4, 1) — n(x, 4, 3)} = — oo, 

x-*oo 

which clearly implies (2.2) (and nothing more). Let c stand throughout this 
paper for positive explicitly calculable numerical constants, not necessarily 
the same in different occurrences. Then we are going to prove the following 

Theorem For T > c, the inequality 

f 2 (T) > log® T 

holds with positive explicitly calculable constant B. 

For/ 3 (x) defined by 


I 


Pv<* 

Pv — 2 =P\ — 1 =Pv 


1 


1 (4) 


/ 3 (x) = 


(2.3) 
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no proofs are known even for 

lim f 3 (x) = oo. 

X~* co 

It seems to be hopelessly difficult at present to prove for arbitrarily large 
prescribed a> the existence of one sequence p v , p v+ p v+0> with 

Pv=Pv+l = = Pv+o> = M 4 )- ( 2 - 4 ) 

This first lower bound for f 2 {T) in our theorem is seemingly not very 
strong, though the (small) exponent B could be considerably improved by 
a more careful treatment. Much more interesting seems to be our guess that 
the events 

Pv = Pv+ 1 = 1 ( 4 ), Pv = Pv+ 1 = 3 ( 4 ) 

Pv = 1 ( 4 ). Pv+ 1 = 3 ( 4 ), Pv S 3 ( 4 ), P v+ 1 3 1 ( 4 ) 

are “not equally probable.” It would be of definite interest—and perhaps 
feasible in the not too distant future—to prove 

="(tab)- (2,5) 

We shall mention a further problem of localization later. 

3. 


Now we turn to the proof of our theorem. In our paper [3] we proved the 
following (see Knapowski and Turan [3, p. 201]). Suppose that in the strip 
1 1 1 <^log 1/10 7 the function L(s) has at least one zero in the 
parallelogramt 

a > i + log” 1/20 T , |r| <i log 1/10 T. (3.1) 

Then we have for T > c, numbers U u U 2 with 

(2 <) T exp( — 2 log 15/1 ^ T)<U l <U 2 < T, (3.2) 

so that the inequality 

X (- 1)<P- ‘>/ 2 log p > jTr 2 (3.3) 

V t <p<U 2 

holds. Let us call a p v = 1 (4) a “bad” one if p v+ , = 3 (4) and a “good” one if 
p v+ ! = 1 (4). Associating each bad p v with its p v+ , the contribution of 

tin (9.1) of Knapowski and Turan [3] the inequality |r| < T reads correctly 
\t\ <ilog 1/10 T. 
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(log p v - log p v+1 ) being negative can be dropped from the left-hand side of 
(3.3). The same holds for the contribution of the p v = 3 (4) which were not 
associated with a p v _ t = 1 (4). Hence from (3.3) it follows in the case of (3.1) 

{f(U 2 )~f(U 1 )} log U 2 > JTT 2 

and a fortiori 

MT) > > Jt exp( — 3 log-/- T) (3.4) 

for T > c. This is much stronger than the inequality of the theorem. 

4. 


In this first case (3.1) we got also the “local” theorem that the interval 

[Texp( —2 log 15/16 T), T] (4.1) 

contains a pair (p v , p v + x ) with p v = p v+1 (4). This was noticed already in 
Knapowski and Turan [3]. We have now to investigate case II which— 
owing to the functional equation—means that the roots of L(s) in the strip 
1 1 1 < \ log 1/10 T are contained in the narrow parallelogram 

|*-i| < log" 1/20 T, |r| <ilog*' 10 T. (4.2) 

For this case we stated in Knapowski and Turan [3] that for T > c and 
suitable 

log 3 T <U 2 exp( —log 15/16 U 2 ) <U 1 <U 2 <T (4.3) 

the inequality 

X (-l) (p - 1)/2 logp> v /I^ (4.4) 

Ui<P<U 2 

holds. The proof of (4.3)-(4.4) in case II was on p. 201 of Knapowski and 
Turan [3], postponed to the English edition of the book of the second author 
since it was rather long and uses appropriately ideas of Littlewood, Ingham, 
and Skewes. This will be indeed so (even with log 2 T instead of log 3 T in 
(4.3)), but for our present theorem a much shorter argument can be given 
(based again on ideas of Littlewood, Ingham, and Skewes essentially). 


5. 


def 


n<x, n odd 


2 <p<x 


)/2 A(n) = A(x) 

(5.1) 

def 

log p = A t (x). 

(5.2) 


Let 
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Obviously, 

Aj(x) = A(x) - £ log P + °(x 113 l°g x ) = A (x) - (1 + o(l)) v /x. 

P<y/X 

Hence if for a ^ > c we have 

(5-3) 

then also Aj(£) > holds and the reasoning of Section 3 gives 

/a(0 >>/*/(log ^). (5-4) 

Hence we have to produce in case II, i.e., under supposition (4.2) a possibly 
large £ < T satisfying (5.3). 


6 . 


In order to do this we write 


def 

A {e*)e~ m = G(9) 


(6.1) 


and we start with the “exact” formula of Riemann-Mangoldt in the finitised 
form as one can find, e.g., in Prachar’s book “Primzahlverteilung,” p. 228. 
Restricting us to 


log log log T < 9 <yo log log T (6.2) 

(to which we are forced by (4.2)) and denoting the nontrivial zeros of L(s) by 

P = P + iy (6.3) 


we get from it easily 

(p-( l/2))» 

G(9)=- X —+ o(4 (6-4) 

|y|<(1/2) logi/io T P 


Now we can use (4.2). We get 


£(p-(l/2)}3 

e iy9 

< 

! 

1 

: 

T 

a 

+ 

1 

1 

P 

iy 


P 

P 

iy 



owing to (6.2) and (6.4), this gives under (4.2)-(6.2) 


<?(») + 


I 

|y|<(l/2)lo g 2 


e 


iy& 


t iy 


< C. 


(6.5) 
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If A is a big constant to be determined later, we form after Ingham the 
integral 

where co is restricted by 

log log log T < (D - i < a> + | log log T. (6.7) 

Using (6.6)-(6.5) and completing the integrals to (— oo, +oo) we get, using 
also the formula 


1 r 00 /sin 
2* i-co \y 




1 — |A| for -1 <2 < +1 
0 for |1| > 1 


of Fejer, the inequality 

jv«> 

»<■».'I)+ I -TT 

\?\<a iy 
< cl 1 + 


(i+ I i) r exp W/AMiy 

\ |y|<(l/2) logi /10 T \y I / J ^4/4 \ >7^ / 


( 6 . 8 ) 


Since the contribution of the terms with | y \ < A to the last sum is 

c ^ 1 log 2 A 

<-. I t r < c-— <c 


|y|<^l 


Ivl 


and those of the terms with A< 171 <? log 1/10 T after partial integration 


< I , y 

A< jy|<(l/2) logi/10 T 1/ 


LM J_ JL\ 
mi?M ! I v I / ‘ c ’ 


we get from (6.8) 


H(a>, A) + 2 £ 


sin ya> 


o < y <a y 


< Co 


(6.9) 


where c 0 is an explicitly calculable positive numerical constant (i.e., does not 
depend on A either), whose value matters somewhat. 


7 . 


We shall need the elegant lemma of Ingham which asserts with 

log 2 A 

a 0 =- -7- 


(7.1) 
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the inequality 


2 I 

0<y</* 


sin ya 0 


-2 


1 


y ~ I «o I 

(cj is the same sort of constants as c 0 was in (6.9)). 


+ c i 


(7.2) 


8 . 

Now let A be the smallest constant satisfying the inequalities 

log 2 yt 

— 4 — ^ i 2 lo g bg2 A > T log A, log A > 3, (8.1) 


further with the constants c 0 and c x 



Co log A, c t < -fa log A, 



lc 

(8.2) 

and finally 

N(A) > 1 

(8.3) 

where 

II 

M 



0 < y <x 


If A* is the value of this A, (6.9)-(8.2) give 

H(o), A*)> -2 £ —^ log A*. (8.4) 

0<y<A* 7 


9 . 

Next we choose co (taking care of (6.7)). Putting 

q = [ 8 OO 71 log A*] (9.1) 

Dirichlet’s theorem gives the existence of an integer m with 

_i_ q -N(A*) log log j < w < _i_ i og i og j (9.2) 

so that for all y’s in (8.4), the inequality 
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holds.f With this m we choose 


def 


co = m + a 0 = co*. (9.4) 

(9.2)—(8.1) assures that (6.7) is not violated for T > c. Then we have for 
all of our y’s with aj = —(log 2 A*)/A* 


| sin yag — sin yco* | = 


sin ya£ 


sin yotf 


— sin | 

- sin | 


ya§ + 2 n 


my 
2 n j 


and hence using (9.5)-(8.1)-(7.2)-(8.2) 


* . Q 2n \ 
yotf + 9 — i <—, 

q q 


1 


(9.5) 


_2 £ sin yco* > ^ ^ sinyocj _ c ^ 

0<y<,4* y 0<y<A* 1 q 0<y<A* \ 7 \ 


>-2 I 


sin ya$ 


0 < y < >4* 7 


- TUo log A* 


> i log A* - Cj - rm log A* > (£ - t&j) log A*. 
Then (6.9) gives from (8.2) 

H(o*, A*) > (i - jbs) log A* - £ log A* == (^ - xoo) log A*. (9.6) 
Going back to G(x) this gives, owing to (8.1)-(8.2), 

1 c A * 14 

2 n 




Hence there is an 


so that 


CO 


* — 1 < co** < co* -+■ \ 


(9.7) 


i.e., 

Putting 

we obtain 


A(e®**) exp( — \co**) — G(a>**) > 3, 
A(e w **) > 3 exp(-jo>**). 
exp(co**) = £ 

Mi) > i\fi 


(9.8) 


t {*} denotes as usual the distance of x from the next integer. 
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and from (5.4) 

/*«;)> 3 Jl. (9.9) 

Since from (9.7)-(9.4)-(9.2) for T > c 

£ < exp(co* + i) < 2 \og 1/20n T < T 

and 

( > exp(o>* -})> log 0 T, 
these and (9.9) give in case II 

f2(T)>f 2 (0>^> l ° gB T 

with a positive constant B indeed. This and (3.4) complete the proof of our 
theorem. 
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Introduction 


Let A be a commutative ring equipped with an involution J A . For con¬ 
ciseness, we write A instead of the pair ( A , J A ), and denote the ring A 
without involution by | A |. A signature a on A is defined as a homomor¬ 
phism from the Witt ring W(A) of nondegenerate hermitian forms over A to 
the ring of integers Z. If | A | has a connected spectrum, either A has no 
signatures at all or the kernels of the signatures are precisely all minimal 
prime ideals of W(A ), as has been shown in Knebusch [7,1, §2] and Dress [3]. 
It also can be shown that A has no signatures if and only if — 1 is a sum of 
norms xJ A (x) in A (proof to be published, cf. [8] for \A \ semilocal). In the 
case that A is a field and J A is trivial it is well known that the signa¬ 
tures of A correspond uniquely to the orderings of A[5 , 11]. For |>4| a 
semilocal ring, a general study of signatures and related topics can be found 
in Knebusch et al [10] and [8]. 

For a homomorphism q>\ A -> B into another ring with involution B an 
extension t of a signature o on A to B (with respect to <p) is defined as a 
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signature r on B such that the diagram 

W(A) —* —> W"(B) 



Z 


with induced by (p is commutative. Recently I developed a theory of “real 
closures” of a pair (/4, <r) with A an arbitrary commutative ring with involu¬ 
tion and a a signature on A (cf. [7]), which generalizes Artin-Schreier’s well- 
known theory of real closures of ordered fields. This theory is tied up with a 
theory of extensions of a for (p finite etale {i.e., \<p\: | A | -► | B | finite etale}. 
As has been shown in [7] there exist only finitely many extensions of a to B 
in this case. Moreover the regular trace Tv B/A from B to A yields a 
W(A )~linear map 


Tr* M : W(B)-+ W(A), 

(cf. [12, 7]) and for z in W(B) the following trace formula holds true [7,1, §3]: 

<r(Tr % A (z)) = £ 

T|<7 

In this sum t runs through the extensions of a to £, and the “multiplicities” 
n(z) are positive integral numbers, uniquely determined by t and (p. If | A | is 
semilocal and J A is trivial, all these multiplicities are 1 [7, II, §8]. 

Inserting z = 1 in this trace formula we see that the number r of signa¬ 
tures of B extending a is at most equal to the rank [B : A] of the projective 
^-module B.lf\A\ is semilocal and J A = id, moreover r = [B : A] mod 2. 

In the present paper we study the extensions of o to B in the case that B is 
a “Frobenius extension” of A. For any element a of a ring with involution we 
denote the image of a under this involution by a. We call B a Frobenius 
extension of A if B is as an v4-module finitely generated and projective and if 
there exists a linear form s: B -> A on this module such that s(b) = s(b) for b 
in B and the hermitian form s(xy) on the A-module B is nondegenerate. If 
the involutions are trivial, this is indeed the usual notion of Frobenius 
extension occurring in the literature (cf. Eilenberg and Nakayama [4]). We 
also say that s is a Frobenius form on B. 

If B is finite etale, the trace Tt b/a is a Frobenius form on B. But the class 
of Frobenius extensions is much larger than the class of finite etale exten¬ 
sions. This is the reason the present paper seems to be a necessary step in the 
theory of hermitian forms over rings. For example, let a be an element in A 
with a = a. The extension B : = A[X]/(X 2 — a) = A[x] with J B extending J A 
and x = x is Frobenius. But if 2 or a is not a unit in A, then B is not etale. 
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Certainly such extensions are important for a study of hermitian forms. A 
useful example of a cubic Frobenius extension occurs in [9]. 

We essentially apply the same methods as in [7], cf. in particular §§3 
and 4 of [7], and we use also the terminology and the notations of [7]. Our 
results are satisfactory only in the case that |,4| is semilocal. Much work 
remains to be done. For example, we do not know in general whether a 
given signature a of A has only finitely many extensions to a given 
Frobenius extension B of A . As has been shown in [7, §3] this is true if B is 
finite etale over A. 


1, The Transfer Formula 

In this section A is an arbitrary commutative ring with involution. Let 
cp: A -► B be a Frobenius extension of A and s: B -* A a Frobenius form. 
Then s induces an additive map 

s*: W(B )- W(A\ 

which is defined as follows. Let (£, <F) be a hermitian space over B , i.e., a 
finitely generated projective B-module E equipped with a nondegenerate 
hermitian form <D. (O is assumed to be antilinear in the first and linear in the 
second variable.) Then s* maps the Witt class [£, <D] of this hermitian space 
to the Witt class [£, s ° O] of the ,4-module E equipped with the hermitian 
form s o <D (cf. Scharlau [12]). Clearly s* is W(,4)-linear, i.e., we have 

s *(<p*(x)y) = x • s*(y) (l.i) 

for x in W(A) and y in W(B). 

Now let cr be a signature on A. We denote by S(q>, a) the set of all 
signatures r of B that extend a. 

Theorem 1.1 There exists a unique family (m(t) | t e S(tp. a)) of integral 
numbers such that almost all m(z) = 0 and for every z in W(B) the following 
equation holds true: 

ff ( s *(z)) = X m(z)z(z). 

x\o 

Here t runs through the set S((p , a) of all extensions of o to B. (If this set is 
empty , the equation reads cr(s*(z)) = 0.) 

Proof The kernel P(t) of a signature t: W(B)^> 1 is a minimal prime 
ideal of W(B) [3, 7]. Thus there do not exist any inclusion relations between 
the kernels of different signatures of B. This already implies that there exists 
at most one family (m(r)|T e S(<p, <r)) for given (p, a, s with the above 
properties; cf. Knebusch [7,1, p. 72]. 
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To prove the existence of such a family (m(r)) we choose a homomor¬ 
phism a from A into a real closed field with involution R inducing a. In more 
explicit terms this means the following. \R \ is an algebraic closed field. J R is 
nontrivial, and thus the fixed field R 0 of J R is a real closed field in the sense 
of Artin-Schreier. We have 


W(R) s W{R 0 ) £ Z 

(R 0 equipped with the trivial involution), and denoting the unique signature 
of R by p we have p ° a* = o. Such a homomorphism a exists according to 
[7,1. §4]. 

The tensor product B ® A R with respect to (p and a, equipped with the 
involution J g ® J R , is a Frobenius extension of R with the Frobenius form 
s ® 1 from B 0^ R to R. It is easily checked that the following diagram 
commutes: 


W(B) 


W(A) 


(1 ®a)* 


W(B ® A R) 

(s® !)• 

W{R) 


We have a unique direct decomposition 


B® a R = Yl B i 

i = 1 


into connected rings with involution B,. Let <p f : R -* B, denote the compon¬ 
ents of the homomorphism (p ® 1 from R to B R. and a, : B —> B, the 
components of the homomorphism 1 ® a from B to B ® A R. We may 
assume that for some r, 0 < r < g all rings | B, | with 1 < i < r are connected 
and all | B, | with r < i < ^ are not connected. Then for i > r, the ring B ; with 
involution is isomorphic to a product D, x D t of two copies of a ring D t with 
the “switch” of the factors as involution. Thus IF(B,) = 0 for i > r. For i < r, 
we denote by N, the nil radical of B, and we have 


B, = q> t (R)®N t . 


Since 2 is a unit in R and N t is nilpotent, q> t induces an isomorphism from 
W(R) s Z onto W(Bi) (cf., e.g., [14, Theorem 2.2.1]). 

Let Si: B t -> R denote the restriction of the R-linear form s ® 1 to the 
direct summand B f of the R-module B ® A R. Clearly, s, is a Frobenius form 
with respect to <p f : R -* B,. Denoting finally the projection from B ® A R 
to Bj, by the induced maps p,, from W(B ® A R) to VF(Bj) yield an 
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isomorphism 

r 

(Pi.,Pr.): W(B ® a R) —FIW'W 

i= 1 

and for u in W(B ® A R) we have 

(s®l )*(«)= f j sfp it (u). 

i = 1 

Now it is not difficult to prove the desired formula for <rs*(z), with z in 
W(B). We have 

as*(z) = p(x*s*(z) = p(s ® 1)*(1 ® a)„ £ (z) 

= i>s?p,-.(i ®«)*( z )= z p s *<*i*( z )- 

i= 1 i=l 

Let denote the unique homomorphism from B { to R with y f ° (p t = id 
(1 < i < r). We have y* ° (p* = id. Since is an isomorphism this implies 
<Pi* ° 7i* — id. We obtain 

r r 

<JS*(z)= Y J ps*<Pi*y i *<Xi*(z)= Pi*Pi*( Z \ 

i = 1 i = 1 

with Pi denoting the homomorphism y t o a, from B to R. Applying (1.1) this 
can be simplified to 

<TS*(Z)= £>S?(1) ' pft.(z). 

i - 1 

Notice that the /?, are precisely all homomorphisms j8 from the ring with 
involution B to R with p ° (p = a. For any signature t in S(<x, (p), we denote 
by /(t) the set of all indices i, l < i r, such that p ° /?,* = t. Of course only 
finitely many J(t) are nonempty. Further, we denote by B( t) the product of 
all Bi with i in /(t). We have 

B ® A R = n B(r). 

Z\<T 

Finally, we denote by s T the restriction of s ® 1 to the direct summand B(t) 
of the R-module B ® A R. Clearly, s z is a Frobenius form on the extension 
B( t) of R. Using these notations we can write 

<rs*(z) = £ m(t)r(z) 


with 


m(t) ! =ps*(l). 


(1.2) 
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In more explicit terms, m(x) is the signature of the hermitian form s t (xy) on 
the vector space B( t) over R. 

This proves our Theorem 1.2 and also gives some insight into the nature 
of the coefficients m(i). 

The coefficients m(x) depend only on cp and t. We denote them hereafter 
by n( t, s). The notation n(x , cp, s) or n(z, A, s) will be appropriate whenever 
it is not clear from the context which base ring A is under consideration. The 
formula 


<TS*(z) = £ n(T, s)t(z) 

t|<7 

will be called the transfer formula for (p : A -► B and s. 

From (1.2) we deduce in the case that the rank [B : A] of the projective 
,4-module is constant, the modest information 


Furthermore, 


E M T > S )I 

T\ff 


(1.3) 


n( t, s) = [B( t): R ] mod 2. 

Thus if [B : A] is constant, we see that 

]T n(r, s) = [B : A] mod 2. (1.4) 

x\a 

From our proof of the transfer formula also the following corollary is 
evident. 


Corollary 1.2 Let s be a Frobenius form on B with respect to cp : A -> B. 
Let o be a signature on A and t be an extension of o to B with n( t, s) ^ 0. Then 
for any homomorphism a : A-+ R into a real closed field with involution R that 
induces o there exists a homomorphism B -> R that induces x and extends a, 
i.e ., p o ip — a. 

In general n(x, s) may well be zero (cf. Section 2). We mention two cases 
in which n(x , s) f 0 for all extensions x of o to B. 

Remark 1.3 If cp is finite etale and s is the regular trace Tr BM , then all 
n(x , s) are positive numbers. Moreover, given a homomorphism a from A 
into a real closed field with involution R inducing er the number n( t, s) 
coincides with the cardinality n(x, a) of the set of homomorphisms f from B 
to R that induce x and extend a (cf. Knebusch [7,1, §3]). Indeed, in the proof 
of the transfer formula now the ideals N i9 1 < i < r, are zero, and the Fro¬ 
benius forms s,: -► R are the inverse maps of the isomorphisms <p f : R A B t . 

This implies immediately n(t, s) = n(t, a). 
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If £ is an hermitian spacef over A of constant rank n then \cr{E)\ ^ n for 
every signature a on A. Indeed, let a: A -> R be a homomorphism into a real 
closed field with involution R inducing o. Then a(E) is the usual Sylvester 
signature of the hermitian space E ® A R over R. We call E positive definite 
at o if o(E) = n. Returning to our extension <p: A -> B with Frobenius form 
s: B A we denote by s*<l> the hermitian space B over A, equipped with 
the hermitian form s(xy). 

Proposition 1.4 If B has constant rank over A and s*< 1> is positive 
definite at o, then for every signature x on B extending o the coefficient n(x, s) 
is positive. Moreover , given a homomorphism a from A to a real closed field 
with involution R inducing o we have n(r, s) = «(t, a). 

Proof Returning to the proof of the transfer formula we see that all 
spaces sf <1>, 1 < i < g , are positive definite over R. Thus r = g. Further¬ 
more, the ideals N t must be zero since otherwise the highest power Nf of N t 
that is nonzero would be an isotropic subspace of sf<l>. Thus sf< 1> has 
rank one and s*<l> has rank n( t, a). We obtain 

n(i, s) = ps*< 1> = n( t, a). 

We still have to show n(t, s) > 0 for a given extension x of o to B. We choose 
a homomorphism p from B into a real closed field with involution R induc¬ 
ing t, and we put a : = jS ° (p. Then certainly n( t, a) > 0 for this particular t, 
hence n(x, s) > 0. QED 

We now look at what happens to the coefficients n(t, s) if we replace s by 
another Frobenius form s' with respect to <p. We have 

s'(x) = s(bx) 

where b is an element of B with b = b uniquely determined by s and s'. Since 
the hermitian form s'(xy) is nondegenerate, b must be a unit of B. Let <fe) 
denote the hermitian space over B consisting of the B-module B and the 
hermitian form bxy. We denote the class of this space in W(B) again by <f?>, 
and the value of this class under a signature x on B by x(b). Clearly 

s'*(z) = s*(<b>z) 

for z in W(B\ hence 

= E "( T > s H< b > z ) = Z n ( T ’ s M fc M4 

t|<7 t|<t 

Thus we have 

n(x, s') = x(b)n(x, s) (1.5) 

for every signature t on B. Notice that x(b) = ±1 since <b> 2 = 1. 


t We write for brevity E instead of the pair (£, <D). 
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2. Some Examples 

We first study quadratic extensions. Let A be an arbitrary commutative 
ring with involution, and let a be a unit in A with a = a. We consider the ring 

B'=A[T]/(T 2 -a). 

We have B = A ® At with the relation t 2 = a. We extend the involution J A 
to B either by prescribing t = t (case I) or by prescribing t=—t (case II). 

We introduce the ^-linear form s on B with s(l) = 1, s(r) = 0. Then in 
both cases s(x) = s(x) for every x in B. Using the basis 1, t of B over A, we 
obtain 


s*<l)S<l,a> (easel), 

s*<l> = <1, — a) (case II). 

Thus s is a Frobenius form on B over A. 

Let a be a signature on A. 

Proposition 2.1 (i) Assume T = tiff (j(a) = — 1, then c has no extension 

to B. If o(a) = +1, then o has precisely two extensions z u z 2 to B , and 
n( Ti, s) = n(r 2 , s)_= 1. 

(ii) Assume t — —t. If o{a)= +1, then a has no extension to B . If 
g (a) = — 1, then a has a unique extension z to B and n(z , s) — 2. 

Proof If t = t, then a = ttis a norm in B , and a signature a on A with 
o(a) = — 1 cannot be extended to B. If t= — f, then — a = ttis a norm in £, 
and a signature a with o(a) = + 1 cannot be extended to B. In the remaining 
cases s*<l> is positive definite at o. Thus by Proposition 1.4 every extension 
t of a to B has coefficient n(z, s) > 0. Furthermore, 

£ n(r, s) = <ts*(1 ) = 2. 

tj<T 

Thus one of the following two possibilities must occur: 

(A) g has precisely two extensions z u z 2 , and n(z u s) = n(z 2 , s) = 1. 

(B) g has a unique extension t, and h(t, s) = 2. 

Anyway, g has at least one extension to £, and we choose such an extension 
z v We further choose a homomorphism from B to a real closed field with 
involution R inducing z v We put a ! = ° <p, with (p the inclusion map from 

A to B. Then a induces g. Let y denote the automorphism of B over A of 
order 2, defined by y(t) = — t. Notice that y is indeed compatible with the 
involution J B in both cases. Consider the homomorphism fi 2 : =Pi ° y from 


t Cf. the notations at the end of Section 1. 
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B to R. We have ft 2 (t) =/= /?, (r) and thus jS 2 is different from P,. Clearly, 
and /} 2 are all homomorphisms from B to R extending a. Let p denote the 
unique signature of R. If t = t, then we have a hermitian space <t> over B 
consisting of the B-module B and the hermitian form txy. We have 

(p°/*a.KO= -(P-A.KO- 

Thus there exist two different signature p ° p,* and p ° /? 2 » extending a, and 
possibility (A) is realized. If F = — t, then 

Pi — Jr ° Pi- 

Now J R is an automorphism of the field with involution R that induces the 
identity on IT(/?) (cf. [7,1, Lemma 3.11]). Thus p ° p 2 * = p ° Pr> = Tj, and 
we obtain n(z,, a) = 2. According to Proposition 1.4, we have n(x u s) = 2, 
and possibility (B) is realized. 

If B = A[T\/(T 2 - a) but a is not a unit in A, the situation may be very 
different. 

Example 2.2 Let A be the localization of the polynomial ring K[x l5 ..., 
x„] in n ^ 2 variables x, with respect to the maximal ideal generated by x 1; 
..., x„, and let m denote the maximal ideal Ax, + • • • + .4x„ of A. We choose 
the element 

a-=-(x ? + ••• + X?) 
in m and study the extension 

B — A © At, 


t 2 = a , 


of z4, both A and B being equipped with the trivial involution. The ^-linear 
form 5 on B defined by 5 ( 1 ) = 0, s(t) = 1 is a Frobenius form over A. Indeed, 




0 1 
1 a 


The ideal 9J? : = nt B is the unique prime ideal of B lying over A, and 

A /nt = B/9JI = R. 

We consider the evident signatures attached to m and 


a\ W(A) W(A/m) 


2, t: W(B) -> W(B/$Jl) 


Notice that t extends o. Since A is regular, o can also be extended to a 
signature of the quotient field R(x u x„) of A, equipped with the trivial 
involution, as has been shown by Craven, Rosenberg, and Ware [2]. Thus 
there exists an injection a of A into a real closed field with involution R 
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inducing a. But this homomorphism oc certainly does not extend to a homo¬ 
morphism p from B to R since a (a) is a negative element of the fixed field R 0 
of J R , and P(t ) would be an element X of R 0 with X 2 = a (a). In particular 
n( r, 5) — 0. 

Remark 2.3 From Theorem 3.1 in the next section it can be easily 
deduced that x is the only extension of a to B. 

We now shall study extensions of type B = ,4[T]/(T"). We have the fol¬ 
lowing general fact. 

Lemma 2.4 Let cp: A —> B be a homomorphism in the category of commu¬ 
tative rings with involution. Let 91^ and 9I B denote the nil radicals of A and B 
and A , B denote the reductions A/M a and B/M b . Assume that (p induces an 
isomorphism cp\ A-> B. Then every signature of A extends in a unique way to a 
signature of B with respect to cp. 

Proof We have a commutative diagram 

A * ^ B 


A -> B 

iP 

with 71 and ri the canonical surjections. It suffices to prove the lemma for the 
homomorphisms n and n f . Then it will be evident for (p too. Thus we assume 
hereafter that B = A and (p is the canonical surjection from A onto A. 

Let o be a signature on A. We choose a homomorphism a from A into a 
real closed field with involution R inducing a. We have a unique homomor¬ 
phism a from A to R with a ° q> = a. Let x denote the signature on A induced 
by a. Then x extends <7. The remaining assertion that x is the unique exten¬ 
sion of o now follows from the fact that the map cp* from W(A) to W(A) is 
surjective. Since I did not find a reference for this probably well-known fact 
in full generality, I indicate a proof. 

We show that for a given hermitian space ( U , h) over A there exists a 
hermitian space (£, O) over A whose reduction mod is isometric to 
(( 7 , h). There exists a finitely generated projective /t-module £, unique up to 
isomorphism, such that E/9l A E is isomorphic to U (cf., e.g., Swan_[13, 
Theorem 2.26, p. 89]). Thus we assume a priori that U is the reduction £ of a 
given finitely generated projective ,4-module £ modulo 91^. We now can 
find another hermitian space (F, h') over A such that V © V is a free module 
over A. Indeed, let IT be a module over A such that U © W is a free 
T-module of finite rank. Then take 

(K, fe') 5 =(ir, - ft) 1 H{W) 
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with H(W) the hyperbolic space W ® W* constructed from W. 
(W* = antidual module of W, W® W* equipped with the obvious hermi- 
tian form which is zero on W x W and on W* x W*.) We again regard V as 
the reduction mod 9^ of a finitely generated projective ,4-module F. The 
module G : = £ ® F over A is free. Let g u ..., g n be a basis of G over A, and 
let g l9 ..., g n denote the corresponding basis of the reduction G = U ® V. 
We now lift the hermitian form h 1 h' of G to a hermitian form \j/ of G by 
lifting the hermitian matrix of h _L h' with respect to the basis g u ..., g n to a 
hermitian matrix over A in an arbitrary way. Let $ denote the restriction of 
^ to E. Clearly, <$) reduces mod 91^ to the nondegenerate form h on G, hence 
<S> itself is nondegenerate. Thus (£, <t>) is a hermitian space over A that has 
mod 9^ the reduction (L7, h ). 

Remark If A contains an element g with g + p = 1 (e.g., 2 is a unit in 
A), then by well-known arguments any two hermitian spaces over A that 
have isometric reductions mod 9 \ A are themselves isometric. Thus the can¬ 
onical map from W{A) to W(A) is an isomorphism. We do not need this fact. 

We now consider an arbitrary commutative ring A with involution and 
study the extension B = ,4[r]/(T") for some n ^ 2. We have 

B = A © At © ■ • ■ ® At n ~ \ f = 0, 
and we extend J A to an involution J B on B by prescribing t — t. 

Proposition 2.5 The A-linear form s on B defined by 

s (l) = s (f) = ••• = s(t n ~ 2 ) = 0, sit"- 1 ) = 1 

is a Frobenius form over A. Every signature o on A has a unique extension x to 
B (cf. the preceding Lemma 2.4). n( r, s) = 1 if n is odd , and n( t, s) = 0 if n is 
even . 

Proof We have indeed s(x) = s(x) for x in B. If n = 2m + 1, the space 
s*<l> has the orthogonal decomposition 

m— 1 

s*<l> = (^f m )l _L (At‘+ At 2m ~‘). 

1 = 0 

Thus using an obvious notation 

(0 1 

s*<l> = < 1> _L m x t Q . 

If n = 2m, we have the orthogonal decomposition 

m — 1 

S*<1>= _L(/ir‘ + At 2m ~ l ~‘) 

i = 0 
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and 

s*< 1> = m x 

Thus on the Witt ring level s*(l) = 1 if n odd and s*(l) = 0 if n even. Let now 
a be a signature on A and r be the unique extension of a to B. By the transfer 
formula 


0 1 
1 0 


n(*, s) = gs*( 1) = L’ 


n odd 
n even. 


3. Integral Extensions of Semilocal Rings 

We now assume that our ring with involution A is semilocal, i.e., | A \ has 
only finitely many maximal ideals. In this case there exists for every signa¬ 
ture g on A a prime ideal p(<r) of A intimately related to g , whose definition 
and relevant properties I want to recall (cf. [7,1, §4 and Appendix B]). 

Let A 0 denote the fixed ring of J A , equipped with the trivial involution. 
We introduce for g a given signature on A the set r(cr) consisting of all units 
a of A 0 with <j(a) = +1, and the set Q(g) consisting of all finite sums 

N(^i)a 1 + * ’ • + N(X r )a r 

with a { in r(cj) and “norms” N(X { ) = A f of elements of A such that the 
ideal generated by X u ..., A r is the whole of A. Clearly Q(o) is a multiplicative 
subsemigroup of A 0 . Let — Q(g) denote the set of all elements — x with x in 
Q(g) and p(g) 0 the complement of Q{g) u (- Q(g)) in A. The following facts 
have been proved in [7] and in a more special situation already by Kanzaki 
and Kitamura [6]. 

PI A 0 is the disjoint union of Q{g\ — Q(g), and p(cr) 0 . 

P2 p(c) 0 is a prime ideal of A 0 . 

P3 The set of all x in A with N(x) lying in p(<r) 0 is a prime ideal p(<r) of A, 
obviously stable under J A , and this is the only prime ideal of A lying over 

p(tf) o ■ 

P4 There exists a unique signature g on the quotient field A(p(g)) of the 
ring with involution A/p{g) extending g with respect to the natural map 
from A to ,4(p(cr)). 

P5 p(<r) contains every other prime ideal q of A such that q is stable under 
the involution of A and g extends to some signature on ^(q). 
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Let o o denote the restriction of a to A 0 . Since the natural map from W(A 0 ) 
to W(A) is surjective, g is the only extension of g 0 to A. Clearly r(<7 0 ) 
coincides with r(a). 

P6 Q(g o) coincides with Q(g) and p(<7 0 ) coincides with p(<t) 0 • The natural 
map from /4 0 (p(<x) 0 ) to /4(p(<r)) identifies /4 0 (p(<7) 0 ) with the fixed field 
A(p(g)) o of the involution of A(p(o)). The signature a is the unique extension 
of g 0 to /4(p(cr)). 

P7 r(a 0 ) is the set of all fractions uv~ 1 with u, v the images in A 0 /p(cr) 0 of 
elements u, r of Q(g). Notice that r(tf 0 ) is just the set of positive elements of 
the ordering of /t 0 (p(<x)o) corresponding to g 0 . 

P8 Every element s of Q(g ) can be written in the form 

s = cii bl (^ 2 ) a 2 H - ’ ’ ‘ N(A r )a r 

with a u ..., a r in r(<r) and X 2 , ..., A r in A. (If A 0 has no residue class fields 
with fewer than four elements, then even s = a x -1- a 2 with a x , a 2 in r(a).) 
P9 r(o-) is the intersection of Q(g) and the set A$ of units in A 0 . 

P10 Q(g) + p(cr) 0 = Q(g). 

Pll For x in A but x not in p(<r), the norm N(x) lies in Q(g). 

We call p(<r) (resp. p(a) 0 ) the prime ideal of A (resp. of A 0 ) associated with g. 

Let now cp: A -> B be a homomorphism from A to another semilocal ring 
with involution 1 5, and let (p 0 : A 0 -> B 0 denote the restriction of (p to the 
fixed rings of the involutions. Let further r be a signature on B and let g 
denote the restriction t o cp* of t to A with respect to cp. 

Theorem 3.1 Assume cp is integral , i.e., every element of B is integral over 
the subring (p{A). Then p(<r) is the preimage of p(r) under (p , and p(^) 0 , resp. 
Q(g\ are the preimages of p(r) 0 , resp. Q(t), under (p 0 . 

Proof p(t) is the unique prime ideal of B lying over p(r) 0 , and p(tr) is 
the unique prime ideal of A lying over p(o-) 0 . Thus it suffices to prove the 
statements about p(cr) 0 = p(a 0 ) and Q(g) = Q(g 0 ). Replacing <p by <p 0 , we 
assume without loss of generality that A and B both have trivial involutions. 

Without any assumption about cp it is clear that Q(g) is contained in 
(p l Q{ T ) anc * — Q(g) is contained in cp~ 1 ( — Q(t)). Considering the comple¬ 
ment of Q(g) u (- Q(g)) in A we learn that p(<r) contains cp~ ^(t). (This also 
follows from the fact that o extends to a signature on B(p(z)) with respect to 
the obvious map.) To finish the proof of the theorem it will suffice to show 
that Q(g) coincides with <p~ 1 Q(r). Indeed, considering again the complement 
of Q(a) u ( — Q(g)) in A , this will imply that also p(tr) coincides with 
<P _1 p(t). 

Suppose there exists an element t in cp 1 Q(t) that does not lie in Q(g). 
Clearly, t also does not lie in — Q(o) since this set is contained in 
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<p *( —C(t)). Thus t must lie in p(<x). On the other hand, we have an equa¬ 
tion (cf. P8) 

<p(t) = b t + X\b 2 + ■••■+ X?b r (*) 

with b u b r in T(t) and X 2 , X r in B. 

We consider the field L : =B(p( t)), equipped with the ordering corre¬ 
sponding to the signature f. For b in B , we denote the natural image in Lby 
£; and for a in A , we denote the natural image of (p(a) in L also by a for 
conciseness. We now make the following observation. Let a be an arbitrary 
element of A. Then 1 - at lies in Q(a) since t lies in p(o-) (cf. P10). Thus 
1 — <p(at ) lies in Q( t), and the element 1 - at in L is positive. Now E x > 0 
and, according to (*), E 1 ^ t. Thus we learn that 

a£ 1 < 1 

for every a in A. Introducing the element d*=bi l of B x we have 

a < 3 (**) 

for every a in A . But this is impossible since B is integral over A. Indeed, we 
have an equation 

d n + (p(a t )d n ~ 1 + • * • + cp(a n ) = 0 
with some n > 1 and elements a t of A. This implies the equation 

3 n + 3 n ~ 1 + * • • + a„ = 0. (***) 

Consider the element 

c : = 1 + s l a 1 + •• * + £ n a n 

with £. = +1 if a t > 0 and = - 1 if a { < 0. Then in an obvious notation 
c = 1 + |Si| + ••• 4- |a„|. 

By a well-known lemma the equation (***) impliest 3 < c, which contradicts 
(**). Thus an element t as above cannot exist, and we have Q(a) — (p~ 1 Q( t). 
This finishes the proof of Theorem 3.1. 

This theorem has the following consequence which is important in view of 
the preceding sections. 

Corollary 3.2 We assume again that <p: A B is an integral homomor¬ 
phism between semilocal rings , that o is a signature on A , and t an extension of 
a to B. Let cl be a homomorphism from A to a real closed field with involution R 
that induces a and has kernel p(er). (Such a homomorphism clearly exists.) If 
the field L : = 5(p(r)) has trivial involution or if the field K : = y4(p((j)) has 
nontrivial involution , then there exists a unique homomorphism p from B to R 

t In fact, 3 ^ Max(l, \a t \ + -1- 1^,1). 
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that extends oc, i.e\, a = /? ° i/f, and induces x. In the remaining case that K has 
trivial involution but L has nontrivial involution there exist precisely two such 
homomorphisms p t and /? 2 , and p 2 = Jr ° Pi- 

Proof a factors through a unique homomorphism a from K to R, and a 
induces the signature o on K. The field L is an algebraic extension of K and 
the signature f of L extends a. We want to investigate how many homomor¬ 
phisms p from L to R exist that extend a and induce f. This is possible by the 
classical Artin-Schreier theory of orderings and real closures. 

Assume first that L, hence also R, has trivial involution. Then a maps K 
into the field R 0 which is real, closed in the classical sense, and also p must 
have its image in R 0 . By Artin-Schreier’s theory there exists a unique homo¬ 
morphism p from L to R 0 that extends a and induces T. 

Assume now that X, hence also L, has nontrivial involution. Let 
a 0 : K 0 -► R 0 be the restriction of a to K 0 . Again we have a unique homo¬ 
morphism p 0 from Lo to R that extends a 0 and induces the signature x 0 on 
Lq (t 0 — restriction of t = signature induced on B 0 (p( t o)) by t 0 ). Now Lis 
the tensor product of Lq and K over K 0 . Thus there exists a unique homo¬ 
morphism p from L to R that extends both a and p 0 . The signature induced 
on L by p extends f 0 and thus coincides with f. Clearly /? is the unique 
homomorphism from L to R that extends a and induces f. 

Assume finally that K has trivial involution and L has nontrivial involu¬ 
tion. We have a unique homomorphism P 0 from Lq to _R that extends a and 
induces f 0 . There exist precisely two homomorphisms p x and p 2 from L to R 
that extend p 0 since L is a quadratic extension of Lq and | R | is algebraically 
closed, and P 2 — J R ° p x . Both homomorphisms induce signatures on L that 
extend f 0 . Thus these signatures both coincide with x. 

In this way we have found the homomorphisms p from L to R that 
induce f and extend a in all cases. Composing these homomorphisms p with 
the natural map from B to L we obtain all homomorphisms from B to R that 
have kernel p(t), induce x on £, and extend a. 

To complete the proof of our corollary it remains to be shown that any 
homomorphism p from B to R that extends a and induces x has kernel p(r). 
Let q denote the kernel of p. Then q lies over the prime ideal p(<r) of A. 
Furthermore, we obtain from /? a signature on B(q) that extends x. Thus 
q c= p(t). Since also p(r) lies over p(a) the prime ideals p(r) and q must be 
equal, according to a well-known theorem of Cohen-Seidenberg about 
prime ideals of integral extensions (e.g., [1, §2, 1, Corollary 2]). 

From this Corollary 3.2, now proved, we obtain immediately 

Corollary 3.3 Let <p: A-> B be a finite homomorphism between semilocal 
rings with involution , and assume that B can be generated as a module over A 
by n elements. Then an arbitrary signature on A has at most n extensions to B. 
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Indeed, just observe that in the situation of Corollary 3.2 the homomor- 
phisms from B to R that extend a correspond one to one with the homomor- 
phisms from B ® A A(p) to R that extend the homomorphism a from /4(p) to 
R induced by a. The algebra B 0^ /4(p) has rank at most n over /l(p). 

If we study the extensions of a signature g on A with respect to our 
integral homomorphism cp: A -►£, we may pass from A and B to their 
localizations with respect to p(<x). This is a consequence of the following 
localization lemma. 

Lemma 3.4 Let a be a signature of the semilocal ring with involution A , 
and let S be a multiplicative subset (= subsemigroup) of Q(g). Assume that the 
localization S~ l A of A with respect to S is again semilocal Then there exists a 
unique signature d of S~ 1 A extending a with respect to the natural map from A 
to S~ l A. We have p(or) = S _1 p( ff Vp(<x)o = S~ 1 p(a ) 0 , and Q(g) = S~ 1 Q(o). 

Proof Since the natural map from A to A(p(cr)) factors through the map 
from A to S~ 1 A, there exist extensions of a to S~ 1 A. Let t be one of them. 
Clearly, S~ 1 T(g) is contained in T(t), hence S~ 1 Q(g) is contained in Q( r). 
Let u be a unit of the fixed ring of the involution of S~*A. The set 

S~ 1 A 0 is the disjoint union of the prime ideal S -l p(<r) 0 and the sets 
±5“ 1 Q(g). Thus u lies in one of the sets ±S~ A Q(cr). According to Property 
P9, applied to t, we have t (u) = + 1 if u lies in S~ 1 Q(g), and r(u) = — 1 if w 
lies in — S~ 1 Q(g). Since t is determined by the values on the hermitian 
spaces of rank one [Knebusch et al. 10, Proposition 2.16], we see that t is the 
only signature of S~*A extending o. 

We have a natural map from S~ l A to A(p(<x)) that identifies the field 
^(p(cr)) with the residue class field of S~'A with respect to the prime ideal 
S -1 p(<r). The restriction of the signature a on ,4(p(cr)) to S~ l A is an exten¬ 
sion of a and thus coincides with t. This implies that S~ VC* 7 ) is contained in 
p(r). We now know that S~ ^( 0 % * s contained in S~ ^( 1)0 and from above 
that S~ 1 Q(o) is contained in Q( t). Since S~ 1 A 0 is the disjoint union of the 
sets S _1 p(^) 0 , ±S _1 Q(cr), and also the disjoint union of the sets p(r) 0 , 
±Q( t), we have 

S~ X P (^)0 = P(*)o , S~ l Q(o) = Q( t). 

It remains to be shown that S~ 1 p(cr) coincides with p(r). Let x be an element 
of S~ 1 A that does not lie in S~ Then x = s~ 1 y with s in S and y in A 
but y not in p(<r). According to Property Pll, the norm N(y) lies in Q(g), 
hence N(x) lies in S~ x Q(o), which coincides with Q( t). Thus, again by Pll, x 
does not lie in p(r), and we see that the subset S -1 p(<r) of p(i) actually 
coincides with p(r). This finishes the proof of Lemma 3.4. 

Proposition 3.5 (i) Let cp: A^B be a finite homomorphism between 
semilocal rings with involution. Let o be a signature on A and let p denote the 
associated prime ideal p(cr). Then a has a unique extension a to A p , and every 
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extension x of o to B has a unique extension x to B p . Thus we have a one-to-one 
correspondence between the extensions of a to B and the extensions of o to B p . 

(ii) The set Q(x) consists of the elements uv~ l with u in Q(x) and v in 
Q(g), and p(t) is the localization p(x)B p of p(x). 

(iii) Assume that in addition q> is a Frobenius extension and that s is a 
Frobenius form on B over A. Let s: B p -> A p denote the induced Frobenius form 
on B p over A p . Then n( t, s) = n(x , s) for every extension x of o to B. 

Proof Parts (i) and (ii) of the proposition follow from the preceding 
lemma, applied to both A and B and the multiplicative set S : = Q(<j). Indeed, 
by Property Pll we have S~ l A = A p and 5 _1 B = B p . Notice that S is 
contained in Q(x) for every extension x of o to B. 

Assume now that s: B -> A is a Frobenius form over A. For y in W(A ), 
we denote the image in W(A p ) by y and for z in W(B) the image in W{B p ) by 
z. We have 

<xs*(z) = ct(s*(z))~ = ff(s*(z)) = £ n{x, S)f(z) = £ ”( ? > 5 M Z )- 

t|<t x\a 

Thus indeed n( f, s) — n(t, s) for all signatures t on B extending o. 

Example 3.6 Assume in the situation of part (iii) of the preceding pro¬ 
position that B p is finite etale over A p . Then n( r, s) f 0 for every signature x 
on B extending a. More precisely, | n(x, s) \ — 2 if ^4(p(<x)) has trivial involu¬ 
tion and 5(p(i)) has nontrivial involution, and | n(x , s) | = 1 otherwise. 

This follows from our Proposition 3.5 using (1.5) and the determination 
of the multiplicities n(x) for finite etale extensions of semilocal rings in 
[7, II, Proposition 8.5]. (In [7] it is assumed that A and B are connected. 
This restriction can be removed easily in our situation.) 

Our theory of prime ideals associated with signatures on semilocal rings 
and the consequences drawn from this theory in the present section im¬ 
mediately generalize to “weakly semilocal rings” as considered in [7]. I call 
a ring A with involution weakly semilocal if A contains a semilocal ring A 
(stable under the involution) such that A is integral over A. Then A is the 
limit of an inductive system of semilocal rings with involution with finite 
transition morphisms. Weakly semilocal rings with involution are a more 
natural category for the present section than semilocal rings with involution 
since an integral extension of a weakly semilocal ring is again weakly 
semilocal. 


4. Frobenius Extensions with One Generator 

Let A be a commutative ring with involution. 

Proposition 4.1 Assume B is a Frobenius extension of A generated by one 
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element 9; B = /4[3]. Let a be a signature on A and a a homomorphism from B 
to a real closed field with involution R inducing o. Then for any extension z of o 
to B and any Frobenius form s on B over A , we have 

| n(x, 5)| <n(r, a) 

with n( r, a) denoting the number of homomorphisms P from B to R that extend 
a and induce z. 

This is an improvement of a previous inequality (1.4). To prove the 
proposition we run through the proof of Theorem 1.1, and we see that it 
suffices to show that for the Frobenius extensions B { of R (1 ^ i ^ r) occur¬ 
ring there we have psf< 1) = 0 or = +1. Now every B t is generated over R 
by one element. Thus our proposition is evident if we verify the following 
lemma. 

Lemma 4.2 Let B be a finite extension of a real closed field with involu¬ 
tion R , and assume that B is generated over R by one element. Then B is a 
Frobenius extension of R. For any Frobenius form s on B we have s*(l) = ±1 
if [B : R] is odd and s*(l) = 0 if[B :R\ is even. 

Proof B = R ® N with N the nil radical of B. We assume N 0. Let 9 
be a generator of B over R. We have 9 = c 4- u with c in R, u in iV, and u is 
again a generator of B. Let n denote the smallest natural number with u n = 0 
(n ^ 2). Then 1, w, ..., u n ~ 1 is a basis of B over the field R. Clearly 

u = u(X 4- v) 

with X in R, X f 0, and v in N. Denoting the unit X -b v by e we have ee = 1 
since u — u. There exists an element p in R such that p + Xfi f 0. Then 
P : = p -f sp is a unit of B , and ft = sp. (This is the classical procedure for 
solving the equation e — Pfi' 1 .) Introducing the element t : =Pu , we have 
t = t, and 1, t, • • •, t n ~ 1 is again a basis of B over R. We introduce the K-linear 
form s on B with 

s(l) = ••• = s(t n ~ 2 ) = 0, s(t w " 1 )=l. 

According to Proposition 2.5 this form is a Frobenius form and s*( 1) = 1 if n 
is odd, s*(l) = 0 if n is even. If s' is another Frobenius form on B , then 
s'*(l) = ±s*(l) (cf. end of Section 1). This finishes the proof of the lemma. 

As a relevant example we consider an extension 

B = A[T]/f(T) 

with f(T) a normed polynomial in A 0 [T] of degree n ^ 2. Let t denote the 
image of T in B. Then 1, t, ..., t n ~ l is a free basis of the ,4-module B. We 
extend the involution J A to B prescribing t = t. We consider the ^-linear 
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form s on B defined by 

s( 1) = s(t) = ••• = s(t n ~ 2 ) = 0, 5(r w_1 ) = 1. 

The hermitian matrix of the hermitian form s(xy) with respect to our basis 1, 
t, ..., t n ~ 1 has the shape 

f 

0 1 

1 * 

1 

and thus is nonsingular, hence s is a Frobenius form. 

We now assume in addition that A is semilocal. Let a be a signature on 
,4, and let p respectively p 0 denote the prime ideal of A , resp. , associated 
with cr. Let further K denote the residue class field T(p), and as always let K 0 
denote the fixed field of the involution J K , which coincides with A 0 (p 0 ). The 
image f(T) of our polynomial f(T) e A 0 [T ] in K 0 [T ] has a decomposition 

f(T) = f\Pi(T) ei 

i = 1 

with pairwise different normed irreducible polynomials p,(T) over iC 0 . Let t 
be a signature on B extending a. Then p(t) 0 is a prime ideal of B 0 lying over 
p 0 and thus 

p(t) 0 = p 0 B 0 + Pj(T)B 0 

with j uniquely determined by t, 1 ^ j ^ s, and Pj(T) a preimage of pj(T) in 
A 0 [T ]. Since p(r) is the unique prime ideal of B lying over p(t) 0 , the polyno¬ 
mial pj(T) remains irreducible over K , and 

p(r) = pB + Pj(T)B. 

We call pj(T) ej the factor of f(T) corresponding to t. 

Proposition 4,3 Let Pj(T) ej be the factor off (T) corresponding to z. Then 
n(z , s) = ± 1 if e j} is odd and n(z , s) = 0 if e } is even. 

Proof We choose a homomorphism a from A to a real, closed field with 
involution R that has kernel p(<j) and induces o. There exist either one 
homomorphism P or two homomorphisms /?, P' from B to R which extend a, 
and in the second case p f = J R o p (cf. Corollary 3.2). Let A denote the image 
of t under p in both cases. Then J = A, and A is a zero of the polynomial pj[T] 
obtained from p/T) by applying a to the coefficients since p factors through 
£(p(t)). Going through the proof of Theorem 1.1, we see that with the 
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notations introduced there 

B{ t) = R[T]/(T - X) ej . 

Our proposition now follows from Lemma 4.2 or already from Proposition 
2.5. 

It would be more difficult to determine the sign of n( t, s) if e } were odd. We 
do not enter into this. 
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The original Gauss lemma is generalized in various ways: First , by replacing the integers 1,2,, 
(p — l)/2 by any subset a t of (p — l)/2 distinct integers less than a prime p such that a } =f= p — a { 
and having any desired property enjoyed by half the integers < p; next by replacing the integers 
less than p by any symmetric (modulo p) coset ofkth power residues of p; and finally by replacing 
the prime p by a composite modulus m and the integers less than p by the totatives of m. 

Application is made to criteria for the kth power character of 2, to the quartic reciprocity law, 
and to the character of a permutation. 

Every textbook in elementary number theory contains a simple and 
elegant result known as Gauss’s lemma. It will be found in a little more 
general form in Hasse [3] and for our purposes can be stated as follows. 

Theorem I Let p = 2n + 1 be a prime . Let S = {1, 2,..., p — 1}, and let 
and S A be two half-sets of S defined as follows: 

Si = {a\, a' 2 , a' n } with a\fp-a'j, a\ =/= a !j 

and 

S£ = {a'i, a '2 , ..., a'n] where a'{ = Xa\ (mod p). 

If S x contains p k elements that are not in S l5 then the Legendre symbol 

(A/p) = (-!)" 
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Corollary 1 Let a\ = i for i = 1,2,..., n, then p x is the number of elements 
in S x that exceed n. This is the original Gauss lemma. 

Corollary 2 Let a f t = 2i — 1 for i = 1, 2,..., n, then p x is the number of 
even elements in S x . This special case will be found in Shanks [7]. 

We next generalize Gauss’s lemma a little further as follows: 

Theorem 2 Let P be any property enjoyed by half the elements a { but not 
by p — a, ofS = {1, 2,..., p — 1} where p is an odd prime , and let and p x be 
respectively the number of elements in S t and S x of Theorem 1 that do not 
possess property P, then 

(A/p) = (-1 

If property P is enjoyed by all the elements of S u then = 0 and p x is 
the number of elements in S x that are not in S u which is Theorem 1. 

Before proving Theorem 2 and hence Theorem 1 as well, we first genera¬ 
lize it to subsets of the first p — 1 integers as follows: 

Theorem 3 Let p = 2 kn + 1 be a prime , let a x < a 2 < * * • < a 2n < p and 
let 

S = {a 1 ,a 2 ,...,a 2 „} with ct 2 „ + = p - a, 

be any coset of the kth power residues of p and let kbea fcth power residue of p. 
Let P be a property enjoyed by half the elements a t of S, but not by p — a t . Let 

Si = {a'i, a' 2 ,a'„} with a\±p - a ), a' + a '• 

and 

S x = {a f [ 9 a f 2 , ..., a"} where a” = Xa\ (mod p) 

be two half-sets of S y having respectively and p x elements not possessing 
property P, then 

if k is odd , 


if k is even. 

Proof Since a\ f p — af it follows that a'{ =£ p — a'j (mod p). If either a\ 
or a" does not possess property P, then its complement modulo p must 
possess property P, but there are such elements in S u and \i x in S x . 
Therefore, the product of all the elements in S x 

PI a" = X n f[a' i = X n {- l)^7i = ( - 1 Y*n (mod p) 

i= 1 i= 1 
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where n is the product of all the elements in S, having property P. Since p 
does not divide n, we have 

(-1)"‘ + w = A" = 2 (p " l)/2k = (-) (mod p). 

\Pf 2k 

Since X is a feth power residue, it will be a 2fcth power residue for fc odd if 
it is a quadratic residue. Hence the theorem follows. If k = 1 this becomes 
Theorem 2. 

If all the elements of S 1 satisfy property P, then Theorem 3 is a generali¬ 
zation of Theorem 1 with k = 1. 

We next give some applications of Theorem 3 in case 2 is a feth power 
residue. 

Theorem 4 Let p = 2fen + 1 and suppose that 2 is a feth power residue of 
p, then the parity of the number of elements exceeding p/2 in a half-coset 

Si — {flj, q *2 * • • • 9 uj,} with aj^ p — U; 

of any coset of feth power residue modulo p is the same as the parity of the 
number of odd elements in the half coset if and only if 2 is a 2feth power residue 
of p. 

Proof Let P be the property that an element is less than p/2 and let P' 
be the property that an element is even. Let pi and p 2 and, correspondingly, 
p\ and p 2 be the number of elements that do not possess property P and P' 
in Si and S 2 , respectively. Then by Theorem 3 with X = 2 we have 

(2/p) 2t = (-i)- + «=(-ir + -' 

or 

Pi+ p 2 = p\ + p' 2 (mod 2). 

But every element which exceeds p/2 in generates an odd element in 
S 2 , therefore p t = p 2 and hence p 2 = p\ (mod 2) so that 

(_!)/*! + «' = ( ( 2/p ) when k is 0(1(1 

' \(2/p) 2fc when fc is even, 

which is the theorem. 

Corollary If k is odd , and if 2 is a feth power residue , then 
Pi = pi (mod 2) if and only if p = ±1 (mod 8). 

This corollary was first proposed as a problem for k— 1, for the set 

Si = (<A g 2k ,---,g" k } 

where g is a primitive root of p = 2 nk + 1. 

Example 1 Let fc = 3, p = 31, g = 3, aj = 3 3 ‘ (i = 1, 2, 3, 4, 5). 

S, = {27, 16, 29, 8, 30}, = 4, p\ = 2. 
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As predicted by the corollary the p’s have the same parity. 

Theorem 5 Let p = 2nk + 1, let v k be the number of elements < p/4 in the 
half-set S 1 of any coset of kth power residues , where 

= {a\ y a' 2 , • • •, a' n } with a\ < p/2 . 

If 2 is a kth power residue , then 

(_ iy+v* = |( 2 /p) '/ k is odd 

\(2/p) 2k if k is even. 

Proof Using Theorem 3 with X — 2 we note that for every 
a" = 2a'i (mod p) which exceeds p/2 there is an a\ that exceeds p/4, therefore 
v k = n — p 2 , and since p x = 0, the theorem follows. 

Corollary If k is odd , the number of kth power residues less than p/4 is of 
the same parity as (p — l)/2 k if and only if p = ±1 (mod 8). 

For k = 1, the theorem is trivial. For k = 3, p = 31, we have a- = 1, 2, 4, 
8, 15, so that v 3 = 3, 30/6 = 5. 

For k = 2, there are many expressions for the quartic character of 2 
beginning with Gauss in terms of p = a 2 + 4fe 2 , Barrucand and Cohn [1] in 
terms of c 2 + M 2 and the class number h = h(y/—4p), and by Hasse [4] in 
terms of h! = h(J — 8p). Since 2 must be a quadratic residue, p = 1 (mod 8), 
letting n = 2 ri in Theorem 5, and we have 

(-1)V2 = = (-1)0/2 = (_!)<*+»' = (_ipM = (_ 1 y.'/4 

For example, for p = 73, there are v 2 = 10 quadratic residues less than 
p/4, n' = 9, b = 4, d = 3, h = 4, W = 16, all testifying to the fact that 2 is a 
quartic residue of 73. 

Going one step further, we find that for k = 4, the quartic residues of 73, 
less than p/4, are 1, 2, 4, 8, 16, so that v 4 = 5, n = 9, and hence 2 is an octic 
residue of 73 by Theorem 5. It is also well known that (2/p) 8 = ( — l) n+6/4 so 
that in general v 4 = b/4 (mod 2). 

Another application of Theorem 3 is as follows: 


Theorem 6 The multiplication of the elements of any coset S of kth power 
residues of a prime p — 2 nk -hi by a kth power residue X produces an even 
permutation if and only if 



if k is odd 


if k is even. 




GENERALIZATIONS OF GAUSS’S LEMMA 


191 


Proof Since p = 2kn + 1, the set S is symmetric modulo p. Let S x con¬ 
tain the elements a x < p/2 and S\ contain the elements p — a { for i = 1,2,..., 


n. By Theorem 3, multiplication of a { e 
exceed p/2, where 



! by X will contain p x elements that 
if k is odd 

if k is even. 


If these p x elements of S x are interchanged with their complements in the set 
S' x , obtained by multiplying the set S\ by X (mod p), the resulting set will 
consist of n elements < p/2, followed by their complements modulo p. If 
each half is now subjected to the same interchanges, the parity of the permu¬ 
tation will not change so that the permutation is of the same parity as p x , 
which proves the theorem. 


Corollary The multiplication by X of the integers 1,2,..., p — 1 produces 
an even or odd permutation according as X is a quadratic residue or not. 

This is Zolotareff’s theorem [8] which follows from Theorem 5 with 
k = 1 , or directly from Theorem 1 . 

Theorem 6 is applicable to all primes p = 1 (mod k) if k is odd, but only 
to those p = 1 (mod 2k) if k is even. However, in the remaining cases k even 
and p = k + 1 (mod 2k) the permutation can be shown to be always even by 
another method. 

Gauss made good use of his lemma in his third proof of the law of 
quadratic reciprocity. If we use Theorem 3 with X = X(p) = q and then again 
with p replaced by q and with X(q) = p, we obtain the following theorem: 

Theorem 7 Let p = 2nk + 1, q = 2km + 1, and let (p/q) k = (< q/p) k = 1, 
let p q (p) be the number of elements exceeding p/2 among the multiples of the 
first half of the kth power residues of p by q, while p p (q) is the corresponding 
number of multiples of p by the first half of the kth power residues of q that 
exceed q/2 modulo q, then 

(_ = |(P/4)(9/P) = (“ 1 ) <P_ 1>(1 ~ 1)/4 if k is odd 

\ (p/dhkiq/phk if k is even. 

Corollary If k is odd, then 

/ x s 1 1 + P P (q) ( m od 2) if p = q= -1 (mod 4) 
q Wpiq) (mod 2) otherwise. 

When k = 1, this is the reciprocity theorem. When k = 2, p = An + 1 we are 
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dealing with multiples of quadratic residues (or nonresidues) of p and q such 
that (p/q) = (q/p) = 1 and obtain 


(p\ M 

W/4 \P/4 


(__ l y P {<i) + i*q(p) 


Other criteria for the quartic reciprocity law are known, such as Burde 
[2] and Lehmer [6] in terms of p = a 2 + b 2 , q = c 2 + d 2 and of 
p = C 2 + qD 2 , when such a representation exists. These are 


(p\ M — / _ i V */ ac ~ b d\ _ 11 if q = $n + 1, p = C 2 + qD 2 

W/4\P/4 \ P ) l(’"l) D if q = 8n + 5 


so in this case 


M»(p) = Upti) (mod 2) 


= 8n + 1, p= C 2 + qD 2 
\q = Sn + 5, p = C 2 + 4 qD 2 


or 


Example k = 2, p = 41, g = 37, a = 5, b = 4, c = 1, d = 6, D = 1, (p/q) 
4(^)4 = — 1 since the quadratic residues modulo 41 < p/2 are 1, 2,4, 8, 9, 
10, 16, 18, 20 multiplication by 37 (mod 41) gives 37,33,25,21,9,5,1,18,10, 
2 of which the first four exceed p/2, hence p 37 (41) = 4, while the quadratic 
residues of 37 < q/2 are 1, 3, 4, 7, 9, 10, 11, 12, 16 multiplication by 
41 (mod 37) gives 4, 12, 16, 28, 36, 3, 7, 11, 27 (mod 37) with only three 
numbers exceeding 18, hence p 41 (37) = 3 and the result follows. 

In conclusion, we give a generalization to a composite modulus m as 
follows: 

Theorem 8 Let m > 4 and let n = <t>(m)/ 2. Let 

S = {m l9 m 2 ,..., m 2n }’ m i < m an d (w f , m) = 1 

be the set of totatives of m. Let P be a property enjoyed by half the elements m { 
of S, but not by p — m,. And let 

Si = {mi, m 2 ,..., m' n } with m, + mj^O (mod m) 
and 

S x = [mj, m 2 , m"} w/iere m" = AmJ (mod m) 

be two half-sets of totatives of m having respectively and elements which 



GENERALIZATIONS OF GAUSS’S LEMMA 


193 


do not possess property P, then 

(-1 )!.. + «= (W?) ^ m = P X ^ 2P “ 

' \l otherwise. 

Proof Let M be the product of all the totatives of m having property P, 
and let Mj and M x respectively be the products of all the elements in and 
S x . Since either m- or m — mj is an element of S x and similarly either m" or 
m — m” is an element of S A , it follows that M x = (— 1)" ■ 1 M (mod m) and that 

M x = (- 1 y*M = A"(- 1) M1 M (mod m). 

Since (M, m) = 1, we obtain 

(— iy‘i + w = X n (mod m). 

But it is known that for m > 4, 

= = {(Vp) if m = P a or 2p* 

(1 otherwise, 

so that the theorem follows. 

Theorem 8 reduces to Theorem 3 with k = 1 if m = p and gives a genera¬ 
lization of Zolotareff’s theorem mentioned in the corollary to Theorem 6 to 
a composite modulus as follows: 

Theorem 9 The multiplication of the totatives of m> 4 by a number X 
prime to m produces an even permutation if mf p a or 2p a . In case m = p a or 
2p\ the permutation is even or odd according as (X/p) = 1 or — 1. 

Proof Let S x contain the totatives m t < m/2 of m and S\ those that 
exceed m/2 and let property P be that m { < m/2, then p l = 0 in Theorem 8 
and is the number of elements in S A that exceed m/2. If these elements are 
interchanged with the corresponding elements in S\ obtained by multiplying 
S\ by X , the resulting half-sets will contain elements m" < m/2 followed by 
m — m" > m/2. The permutation can now be completed by making corre¬ 
sponding interchanges in both sets, so that the parity of the permutation is 
exactly p x and the theorem follows from Theorem 8. 

For example, for m = 18, n = 3, X = 5, S = { 1, 5, 7, 11, 13, 17}, 
Si = {1, 5, 7}, Si = {11, 13, 17}, S 5 = {5, 7, 17}, S' 5 = {1, 11, 13}, and = 0, 
p 5 = 1, so that there is an odd number of interchanges. In fact, we inter¬ 
change 1 and 17, then 1 and 5 and 13 and 17, and finally 5 and 7 and 11 and 
13, so there are actually five interchanges and since m = 2*3 2 , p = 3 and 
(X/p) = (5/3)= — 1. 

To avoid confusion, it should be noted that Hasse’s [5] generalization of 
Gauss’s lemma to composite modulus finds the parity of the number N of 
solutions (x, y) of the inequality Xx — my < 0 with x < m/2 and prime to m 
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and y < Xjl for X a prime. It turns out that N = p x (mod 2) unless p = 2p x in 
which case N is always even, while p x is odd in case ( X/p ) = — 1 as in the 
above example. On the other hand, for X = 5, m = 18 we have the solutions 
(1, 1), (1, 2), (5, 2), and (7, 2), giving N = 4. 
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Let p be a prime of the form 4k + 3 and let s, t be nonnegative integers. It is shown that any 
solution to | a | 2 + | /? | 2 = p f in the algebraic integers of the cyclotomic field Q(e lmlps ) must have a 
or p equal to zero. 


Let p be a prime of the form 4/c + 3. It is well known from elementary 
number theory that the diophantine equation 

a 2 + b 2 = p 2t 

has only the trivial solutions with a or b equal to zero. In this paper we 
extend this result to the algebraic integers in the cyclotomic field Q(e 27ri/ps ) 
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obtained by adjoining a primitive p s th root of unity to the field Q of ra¬ 
tional. We prove 

Theorem Let p be a prime of the form 4k + 3 and let s, t be nonnegative 
integers. If a and fi are algebraic integers in Q(e 2ni/ps ) that satisfy 

m 2 + m 2 =p\ a) 

then either a or P is zero. More generally, the same conclusion holds if 

M ! + W- \p\\ (2) 

where p is an algebraic integer in 0(e 2ni,ps ) that divides some power of p. 

Proof Let Z denote the ring of rational integers and let p be a prime of 
the form 4 k + 3. If a 2 + b 2 = 0 (mod p) with a, b e Z, then a = b = 
0 (mod p); for a 0 (mod p) would imply that (b/a) 2 = — 1 (mod p), which 
is impossible. The diophantine equation a 2 + b 2 — p‘ has no solution if t is 
odd by a mod 4 argument; and if t is even, an easy induction on t shows that 
either a or b is zero. This proves the theorem for s = 0. 

Now suppose that s is a positive integer. Let £ = e 2nilp> . Then l[C] is the 
ring of algebraic integers of Q(£) and Z[£ + £] is the ring of algebraic integers 
of Q(£ + £), the maximal real subfield of Q(£). Furthermore, {1, £} is an 
integral basis of Q(() over 0(( + (). Let a, /? € Z[£] satisfy (2), where 
p e Z[£] is a divisor of some positive integral power of p. For the moment, 
assume that p is not a unit and that the t occurring in (1) is not zero. Note 
that any positive integral power of p can be expressed in the form pp since 
p = fL (1 — C J ), where j ranges over a reduced residue system modulo f. 
Write 


a = a t + a 2 C, P = b l + b 2 C, (3) 

where a 1? a 2 , b l , b 2 e Z[£ + £]. The principal ideal generated by p factors in 
Z[£] as (p) = (1 — Cf, where <j> denotes the value of the Euler ^-function at 
p s - Thus p = e(l — £) r for some positive integer r and a unit e e Z[£], Hence 

|p| 2 = eeco r , (4) 

where 


o> = ( 1 - 0(1 - 0 = 2 - (C + 0- (5) 

Then (2)-(5) yield 

ococ ■+■ = a 2 ~h a 2 ~h Uj— ct>) + b 2 -h b\ bi ^ 2(2 — cd) 

= (a x H- a 2 ) 2 + (&i + b 2 ) 2 — {o x a 2 + b x b 2 )co 


= seco r . 
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Therefore, in Z[( + £] 

(a t + a 2 ) 2 + (bi + b 2 ) 2 = 0 (mod ct»). 

The residue class ring Z[£ + 4 ]/(co) is a finite field whose order is the norm of 
a>, that is, p. As noted previously, the equation a 2 + b 2 = 0 has only the 
trivial solution a = b = 0 in the field of order p = 3 (mod 4). Hence 
a 1 + a 2 = 0 (mod a>), say a t + a 2 = tja> with rj e Z[( + £]. Thus 

a = aj + a 2 C = + (t](o - a t )C = a t (l — £) + 

Therefore, 1 — £ divides a. Similarly, 1 — £ divides ft. Write a = a t (l — c)- 
ft = j?j(l — Q, where a l5 belong to Z[£] and satisfy 

| aj 2 + \Pi \ 2 = ££cu r_1 . 

Proceed by induction on r to obtain 

|a r | 2 + \ft r \ 2 = ee. ( 6 ) 

Let a 0 = a r /s, ft ( , = ftje so that 

|a 0 | 2 + |/*o | 2 = l, (?) 

with a 0 , Po e 2[£]. Observe that (1) and (2) are already of the form (7) and 
( 6 ), respectively, if t = 0 or p is a unit. The Galois group of Q(C) over Q is 
abelian, so in particular complex conjugation commutes with algebraic 
conjugation. Therefore (7) implies that all conjugates of a 0 do not exceed 
one in absolute value. Hence by a theorem of Kronecker, a 0 is zero or a root 
of unity; and similarly for jS 0 . Whence by (7), either a 0 or is zero. Thus a 
or p must be zero. This completes the proof. 

The theorem is of course false for primes of the form 4 k + 1, for such 
primes can be expressed as the sum of two squares in Z. Suppose p — 2. It is 
not difficult to show that 

M 2 +I 0| 2 = 2 ' ( 8 ) 

has only trivial integral solutions in Z(e 2nll2s ) with a or p zero if and only if t 
is even and s = 0 or 1. Indeed, (8) has a solution with a = P in Z or Z[i] 
according as t is odd or even. 
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1. Introduction 

In his dissertation, E. Artin proved [1] that there are only finitely many 
quadratic extensions E/k(x) for which the ideal class group of the integral 
closure of k[x] has exponent 2 if k is a prime field of odd characteristic and 
the infinite prime does not split. The inequalities from which Artin drew the 
conclusion about the finiteness also give upper bounds on the genera of such 
fields. D. Madden [4] gave a substantial improvement of these upper 
bounds. He also included the case of characteristic 2 and allowed k to be any 
finite field. In Section 3 we give a further improvement of Artin’s theorem. 
The null class group has exponent 2 for such fields 2. Using the class number 
formula for the ambiguous classes and counting the number of integral 
divisors in a divisor class of sufficiently large degree, an application of the 
Riemann hypothesis gives a bound on the genus. If there is no ramified 
prime of degree one, these bounds are better than those obtained by 
Madden. 
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In Section 2 we evaluate the Galois cohomology groups for the null class 
group and the divisor class group associated with a cyclic extension E/F of 
prime degree /, F being an algebraic function field over a finite field of 
constants. The formula giving the number of ambiguous classes plays a key 
role in this evaluation. Knowing the ramification, one can calculate the 
ambiguous class number from this formula, except in the case when the /th 
roots of unity are contained in the field of constants and the degrees of all the 
ramified primes are divisible by /. In this case the least positive degree of an 
invariant divisor class is 1 or /. Both the possibilities are realized. This 
answers affirmatively a question of Rosen [5]. 


2. Galois Cohomology 

Let E/k be a cyclic extension of prime degree / of an algebraic function 
field F/k having the finite field k as its exact field of constants. We use the 
following notation: 

De(Doe) the group of divisors (of degree 0) of E 

Ce (C 0 e) the group of divisor classes (of degree 0) of E 

Ie ( I 0 E ) the group of ideles (of degree 0) of E 

Je (Joe) the group of idele classes (of degree 0) of E 

P E the group of principal divisors of E 

t the number of ramified primes 

S { eif) the gcd of / and the degrees of the ramified primes 

c the minimal positive degree of an invariant divisor class 

G = gal(£/F) 

h l (G, M) the order of the cohomology group H l (G , M). 

We use the same symbol to denote a field and its multiplicative group. 
Finally, for a G-module M, M G will denote the submodule of invariant 
elements. 

We evaluate the cohomology groups for the various G-modules that are 
associated with the extension E/F. In particular, we prove 

Theorem 1 (a) If E/F is unramified , 

*°(G, C 0E ) = h 1 (G , C 0E ) = /i°(G, C E ) 

= /Az 1 (G, C E ) = l 2 if k contains the /th 
roots of unity; 

= / if k does not contain 

the /th roots of unity. 
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(b) If E/F is ramified, 
h°(G, C 0 e) = h'{G, C 0E ) = S(E/F)F 

= r 1 

= cF 

h°(G, C E ) == Ih'iG, C E ) = F 
= /* +1 


if k does not contain the 
/th roots of unity; 

if k contains the /th 
roots of unity and 
8(E/F)= 1; 

if k contains the /th 
roots of unity and 
S(E/F) = /; 

if k contains the /th roots of 
unity and S(E/F) = 1; 

otherwise. 


Proof Since G is cyclic, Jf*’(G, M) £ H i+2 (G , M). Further, if the module 
is finite, by Herbrand’s lemma, /i l (G, M) = h i+ *(G, M). Thus, 

h ] (G, Cof) = h°(G, Cq E ) = [Coe- Norm(C 0£ )] 

— [Cqe- Cqe][Coe : Norm(C 0 e)], (1) 


Coe denoting the image of C 0F under the canonical conorm map C 0F C 0E 
induced by the inclusion F <= E. 

We have the ambiguous class number formula [6] 


h E = [C G 


0 £• 


h F ' c - F ' [t]\ n] 
a[s: 1 ] 


( 2 ) 


where h F = [C 0F - 1] is the class number of F; a the minimal positive degree 
in E of a divisor of F;fj the elements in the multiplicative group of k that are 
norms from E; rj the elements of k that are norms of units of F; and e the 
kernel of the norm map restricted to the units. 

We also know [2] 

[Cqe- 1 ] — ~f if E/F is unramified and k 

contains the /th roots of unity; 

— h F otherwise. (3) 

From (2) and (3), we can calculate [ C G E : C' 0F ]. 

To evaluate the second factor on the right-hand side in (1), consider the 
canonical exact sequence of G-modules 


\^E^I e ^J e ^L 
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In cohomology, it gives the exact sequence 

1 - e g = F - 1 g e = J g e -> if X (G, E). 

The group H 1 (G, E) is trivial by Hilbert’s Theorem 90. Therefore, every 
invariant idele class contains an invariant idele. Also, it follows from 
definition that 

jG j tG r 

1 E — 1 F > ' 0E ' OF ’ 

Thus, 

Je = J f f Joe — J of • 

Consider the canonical map 

Joe * Co e • 

Restricted to j£ £ ,it gives the epimorphism 

Joe = J of ^ C 0£ . 


Norms correspond to norms. Therefore, 

h°(G , J 0£ ) = [J$e- norms] = [ C ' 0F : norm(C 0 E)]- (4) 

Considering the rational integers Z as a G-module under trivial action, 
we have the exact sequence of G-modules 

I^Joe^Je Z-0. (5) 

We are using here F. K. Schmidt’s theorem [7] which states that for finite 
k the algebraic function field has a divisor of degree one. From (5), we 
obtain, in cohomology, the exact sequence 


H^ 1 (G, Z)-> H°(G, J 0E ) -» H°(G, J E ) - H°(G, Z) -» H 1 (G, J 0E ) 

-*■ H l (G, J E ). (6) 

From class field theory, H 1 (G, J E ) is trivial and H°(G, J E ) has order /. 
Also, 


// _1 (G, Z)^H l {G, Z) ^ 


crossed homomorphisms 
principal crossed homomorphisms 


= 1 


tf°(G, Z) s 


invariant elements 
norms 


Z 


“ IZ' 


Idele classes of F have their degrees multiplied by / when considered as 
idele classes of E . Therefore, the map p in (6) is the zero map. Hence, 

h°(G, J 0E ) = h°(G, J E ) = h l (G, Joe) = /• 


(7) 
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From (1), (4), (7), (2), and (3), we have 

h°(G, C 0E ) = /[C&: C 0F ] 

— l 2 (h E /h F ) if E/F is unramified and k 

contains the /th roots of 
unity; 

= l{h E /h F ) otherwise. (8) 

As mentioned above, the degree of a divisor of F gets multiplied by / 

when considered as a divisor of E. Therefore, by F. K. Schmidt’s theorem, 
a — l in (2). From (2) and (8), we have 

h°(G , C 0£ ) = c[ff: rj] if E/F is unramified and k 

contains the /th roots 
of unity; 

= otherwise. (9) 

[e: 1] 

We separate, now, the ramified and the unramified cases. 

E/F Unramified Let E/F be the extension obtained by making a con¬ 
stant extension of degree /. Let T = gal (E/E) = gal (F/F). As in [2], consider 
the exact sequence of T-modules, induced by the inclusion F c F, 

1 N —> C 0F —► Cqf 1* 

In cohomology, it gives 

1 - N r -» C 0 r F - (Cof) r - H'(r, N) ^H l ( r, Cop). 

If k contains the /th roots of unity, 

/ = (A7) r = TV, Cqp = C 0F , h 1 (r,C oF )= 1, h l (r,N)=l. 

Therefore, h F = [ (C 0/ . ) r : 1], If k does not contain the /th roots of unity, 
then N — 1 and the last equation is still valid. 

Clearly, (Coe) f <= C$ E . Therefore, 

h F \h F . (10) 

For an unramified extension, every unit is a norm. If k does not contain 
the /th roots of unity, this is trivial because, then, every unit is norm of a unit. 
If k contains the /th roots of unity, this is a result of class field theory. Thus, 
in each case, 


fr *1] 

[«: 1 ] 


(11) 
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From (2), (10), and (11), 

c — h E = h F . (12) 

Substitution in (8) or (9) gives 

h°(G, C 0£ ) = h l (G, C 0E ) = l 2 if k contains the /th 

roots of unity; 

= / otherwise. (13) 

To calculate /z°(G, C E ) and h l (G , C £ ), consider the exact sequence of 
G-modules 


1 -* C 0E -*• C E Z ->0. (14) 

In cohomology, (14) gives 

1 - Cg E -+ C g e ^ Z -+ H l (G , Coe) H X (G 9 C E ) - 1. (15) 

By (12), the image under /i is /Z. Therefore, (13) and (15) imply, if k 
contains the /th roots of unity, 

h l (G, C 0E ) = l 2 = lh'(G, C E ). (16) 


By Herbrand’s lemma, 

h°(G , C E ) h°(G, Z) h°(G, C 0E ) _ h°(G, Z) 
h 1 (G, C E ) h x (G , Z) * h x {G, C 0E ) h x (G, Z) 


Therefore, (16) gives 


*°(G, Ce) = / 2 . 

(17) 

Similarly, 


/i'(G, C £ ) = 1, h°(G, C E ) = / 

(18) 


if k does not contain the /th roots of unity. 

Before we study the ramified case, we evaluate H X (G , P E ) in general. To 
that end, consider the exact sequence 


1 —> k —> E P E 1, 

obtaining in cohomology 

A 

H~ l (G, E) = 1 - JT *(G, P E ) - tf°(G, k) - H°(G, E) - • • •. 

Therefore, h~ *(G, P E ) = order of the kernel of X. If k does not contain the /th 
roots of unity, h°(G,k)= 1. If k contains the /th roots of unity, /z°(G, /c) = /. 
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In this case a generator of k is a norm from E iff it is everywhere locally a 
norm iff the degrees of the ramified primes are all divisible by l This follows 
from Hasse’s norm theorem. Therefore, 

/z -1 (G, P E ) = h *(G, P E ) = l if k contains the /th roots 

of unity and the degrees of 
the ramified primes are 
all divisible by /; 

= 1 otherwise. (19) 

E/F Ramified We assert that 
c = S(E/F) 

= the minimal positive degree of an invariant divisor, (20) 

if k does not contain the /th roots of unity. For, consider the exact sequence 

1 ► P £ —► D E ► C E —> 1, 

and the induced cohomology sequence 

1 - P g e - D g e C g e - H 1 (G, P E ) -+ • • ■■ • (21) 

(19) and (21) imply that every invariant class contains an invariant divi¬ 
sor. Therefore, c = d(E/F). Substitution in (9) gives 


h°(G, C 0E ) = d(E/F)F 

if 

k does not contain the 
/th roots of unity; 



if 

k contains the /th roots 
of unity and <>(E/F) = 1; 



if 

k contains the /th roots 
of unity and S(E/F) = /. 

(22) 


The orders /i°(G, C E ) and h l (G , C E ) are calculated, as in the unramified 
case, by considering the exact sequence (14). The values are 

h°(G , C E ) --= lh l (G , C E )= F if k contains the /th roots 

of unity and S(E/F) = 1; 

= f +1 otherwise. (23) 

From (13), (16), (17), (22), and (23), we see that proof of Theorem 1 is 
complete. 

We observe that it is only the last case in (22) that involves c. Rosen [5] 
asked the question if c = 1 can, at all, occur in this case. An affirmative 
answer is provided by the following. 
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Proposition Let characteristic k =£ 2 and P(x) be an irreducible monic 
polynomial of degree 4n + 2 in /c[x]. Then E = k(x)(yj p(x)) contains an invar¬ 
iant class of degree one. 

Proof Consider the quadratic constant extension E. The polynomial 
p(x) decomposes as the product of two monic irreducible polynomials over 
the extended field of constants. The formula (2) for the ambiguous class 
number shows that the class number of E is odd. Since gal (E/E) operates on 
C 0E and C 0E is the invariant subgroup, it follows that h E is also odd. Now, the 
ambiguous class number formula applied to E shows that it has an invariant 
class of degree one. 

Remark 1 Some of the arguments used in the proof of Theorem 1 also 
appear in Rosen’s paper [5]. We have repeated them here to make this paper 
self-contained. 

Remark 2 It was proved [2] that h F divides h E if E/F is a normal exten¬ 
sion. Rosen [5] gave another proof. We observe that the normality is an 
unnecessary restriction. Using some results from the theory of abelian var¬ 
ieties, one can, in fact, show that the quotient of the zeta functions is a 
polynomial with rational integral coefficients. To give an arithmetic proof 
for h F | h E , one can assume that there is no field strictly between F and jE, and 
that the canonical map C 0F -► C 0E has a nontrivial kernel. This implies that 
E/F is a pure extension of degree /, a prime. The proof [2] for the divisibility 
is valid in this case. If E/F is nonnormal, a proof can also be given by 
considering a Hilbert class field H of F, i.e., a maximal abelian unramified 
extension of F that has k as its exact field of constants. Then, H n E = F 
and EH/E is abelian and unramified. A simple argument shows that k is the 
exact field of constants of HE. Therefore, by the reciprocity law, C 0F is a 
subgroup of C 0E . 


3. A Theorem of E. Artin 

Let k be a finite field and k(x) be the field of rational functions in one 
indeterminate. Following Artin, we call a quadratic extension E/k(x) imagin¬ 
ary if the infinite prime of k(x) does not split in E. Let R be the integral 
closure of k[x] in E. Then, R is a Dedekind domain with finite class group. 
There are only finitely many imaginary quadratic extensions E/k(x) for 
which the class group of R has exponent 2. We wish to obtain bounds on the 
genus of such fields. We shall assume that the genus g E — g> 1 and, hence 
[4], \k\ = q < 5. 

If h x is the class number of R, one has the relation [7] 


h x — fh E , 


( 24 ) 
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where/ = 1 or 2 according as the infinite prime ramifies or is inert. In each 
case, C 0E is a subgroup of the class group of R. Therefore, its exponent is also 
2. (We leave out of consideration the 5 fields [3] for which h E = 1.) Now, C 0E 
has exponent 2 iff it is equal to its subgroup of ambiguous classes, i.e., 

h E = h E = cx 2'- 1 xfe| = 2‘ l . (25) 

Let m be any natural number not less than 2g — L Then, by the 
Riemann-Roch theorem, the dimension of a class of degree m is m — g + 1. 
The total number of integral divisors in all the classes of degree m is 

2 d (q m ~ 9+1 - 1 )(q- l)" 1 . 

Also, by the Riemann hypothesis, the constant extension of degree m has, at 
least, q m i 2 (q m i 2 — 2 g) primes of degree one. Considering that a prime of E of 
degree dividing m can give, at most, m primes of degree one in the constant 
extension, we see that E has, at least, m~ l q m i 2 (q m i 2 — 2 g) integral divisors of 
degree m. Therefore, the exponent of C 0E is larger than 2 if 

{q - 1 )q m/2 (q m/2 - 2 g) > m2 d (q m ~ 9+ 1 - 1). (26) 

We consider the two cases when the characteristic is even and when it is 
odd. 

Case 1 characteristic = 2 In this case, (20) and (25) give 

h E = h E = 2 d = c2 '~ 1 = S(E/k(x)) 2'~ K (27) 

The genus formula gives 2g + 2 — D, D denoting the degree of the differ¬ 
ent. We observe that the ramification being wild, a ramified prime of degree s 
makes a contribution of, at least, 2s to D. 

Consider the case when q = 2 and no prime of degree one ramifies. For 
g = 9, (28) gives 2 d < 2 3 . Substituting m = 2g — 1, d — 3 in (26), we verify 
that the inequality is satisfied. We omit the formal argument showing that 
(26) is satisfied for all g > 9. It is not satisfied for g = 8. 

When q = 4 and no prime of degree one ramifies, one sees, similarly, that 
(26) is always satisfied. 

If q = 4 or 2 and a prime of degree one ramifies, Madden [4] has shown 
that g < 2, g < 4, respectively. These bounds are better than those derived 
from (26). 

Case 2 characteristic different from 2 Consider the case when no 
prime of degree one ramifies. By (26) and [1], 

if 5(E/k(x)) = 2 

if S(E/k(x)) = 1. 


h x = 2 h E = 2 f 
- 2'" 1 


( 28 ) 
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Therefore, 

h £ = 2 d = 2'” 1 if 5(E/k(x)) = 2 

= 2'~ 2 if 5(E/k(x))= 1. (29) 

In this case, the ramification is tame. A ramified prime of degree s makes a 
contribution of, at least, s to D. Taking m = 2g — 1, one can show that (26) is 
satisfied for q = 3 and 5 if g >5,g>2, respectively. For example, if q = 3, 
g = 5, then D = 12. This implies t < 5. Therefore, from (29), d < 4, in each 
case. Substituting in (26) shows that it is satisfied. 

If a prime of degree one ramifies, Madden’s estimates are better. Combin¬ 
ing our results with those of Madden, we have 

Theorem 2 Let E/k(x) be an imaginary quadratic extension such that the 
integral closure of k[x ] in E has class group of exponent 2. Then , the null class 
group of E also has exponent 2. The various possibilities are 


11 

yo 

g=U 

i n 

II 

g < 2; 

<? = 3, 

g < 4 ; 

q = 2, 

CO 

VI 


If no prime of degree one ramifies , g — 2 is not possible for q = 4 and 5. 
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For a special countable set of irrational numbers, infinitely many integral multiples are constructed 
the decimal expansions of which begin with very large numbers of the digit 9. 

Real irrational numbers have the well-known property that the frac¬ 
tional parts of their multiples lie dense, and even are uniformly distributed, 
between 0 and 1. In a different direction I recently proved [2] the following 
result: To every positive integer n there exists a second positive integer 
P — P(n) such that, if a is any real irrational number, then there is a positive 
integer p = p(ot, n) satisfying 1 < p < P(n) such that every possible sequence 
of n digits 0, 1, 2,..., 9 occurs infinitely often in the decimal expansion of pot . 

In the present note, I shall establish a result of a somewhat different kind. 
Let f(x) be a positive integral valued polynomial of degree m > 1, and let 
a(f) be the decimal fraction obtained by writing the decimal forms of/(l), 
/(2),/(3), ... successively after the decimal point. Then, for every sufficiently 
large positive integer N, among the first 10 N/m digits after the decimal point of 

((10 1 - 1)(10 2 - 1) • (10* - l)) m+ l o, 

there are at most (m + 1 )iV 3 digits distinct from 9. 
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Almost forty years ago I had studied these numbers [1] and proved that 
they are transcendental, but are not Liouville numbers. In the present note I 
apply both notations and results of this old paper. 


1 . 


We begin with an almost trivial lemma. 

Lemma Let 


00 

<r = r 0 - 2>v • 10-’ 

v= 1 

be a convergent series where the r v are positive rational numbers , and the i v are 
strictly increasing positive integers . Denote by d N a common denominator of r 0 , 
r l9 ..., r N , and by e N and e N positive integers such that 

ao 

d N • max(r 0 , r u ..., r N ) < 10 eN , d N • £ r v 10" Iv < 10 _£N . 

v=N+ 1 

If 


£ n > Ne N , 

then at least e N — Ne N of the first e N digits after the decimal point in the 
decimal expansion of d N o are equal to 9. 

Proof The positive integer d N r 0 can be written as 

d N r 0 = (d N r 0 - 1) + 0.999 
where the decimal fraction has only digits 9. Hence 

d N o — (d N r 0 — 1) -f 0.999 ... — ' 10 -Iv — d N * £ r v • lfr. 

v — 1 v=N+ 1 

Here, by the definition of e x and e v , each of the decimal representations 
for the integers d N r u d N r 2 ,...,d N r N contains at most e N digits distinct from 
0, and the decimal expansion of the convergent series 

00 

d N • £ r v 10-v 

v= N+ 1 

has only zero digits in the first e N places after the decimal point. On account 
of the term 0.999 ... the assertion follows immediately. 


2 . 


The numbers o(f) of my paper [1] were defined as follows. Denote by 
f(x) a polynomial of the exact degree m > 1 which for positive integral x 
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assumes only nonnegative integral values and write 

f h (x) = A h f(x) = ( - 1) H ( * ) fix + h — H) (h = 0,1,2,...) 

for the successive differences of/(x). We need consider these differences only 
for 0 < h < m because those with h > m + 1 vanish identically. All these 
differences have integral values for all positive integers x, and they are 
moreover positive if x is sufficiently large. 

Without loss of generality, we impose the stronger restriction that/(x) is 
strictly increasing and positive for x > 1 and that moreover 

f h (x) >0 for 0 < h < m and x > 1. 

It follows in particular that, for y >/(l) > 1, there exists the inverse function 
x = G(y) of y = /(x), and here also g(y) is strictly increasing. 

A little more can be said. We can write/(x) in the explicit form 

f(x) = a _m x m (l + cc l x~ 1 + a 2 x' 2 + • + a m x _m ), 

where a is a certain positive number and a u a 2 , . oc m are certain real 
constants. Hence it follows that for sufficiently large y , 

g(y) = ay 1/m + 0(1). (1) 


3. 

For every positive integer k the function value f(k) can be written as a 
finite decimal 

M k 

/(fc) = X X ~ d k od k i * * * d k M k 9 

X— 0 

where the coefficients d kX are decimal digits 0, 1 , ..., 9, where in particular 

d k o > 0 for all k, 

and where the numbers M k are nondecreasing nonnegative integers. 

We associate now with the polynomial/(x) the infinite decimal fraction 

a (f) = 0 ' d 10 d 1 1 d 1Mi d 2 od 2 i ^2M 2 ^3o^3i 3 *’*• 

By way of example, the polynomial 

0(x) = x(x + l)/2 

has the required properties, and with it is associated the decimal fraction 
a(<p) = 0.1 3 6 10 15 21 28 36 45 55 66 78 91 105 120 136 ... . 
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From my old paper [1] I take a strongly convergent series for <x(/). 
Denote by n the positive integer for which 

10"“ 1 </(l) < 10" - 1, 

and put 

in- 1=0 and j v = [g(W - 1)] for v = n, n + 1, n + 2,...; 
further write 

v— 1 v— 2 

J,= E n(j„-in- i) = ( v - 1 - E iu 

H=n n=n 

for v = n + 1, n + 2, n + 3, ... . 

With this notation 

jn 00 j v 

*(/)= I /(fc)io-*- + E io - j v+jv-i £ /(/c)io-*\ 

fc=l v = n+l Jc=j v -i + l 

Here the finite sums can be summed by means of a formula from difference 
calculus, giving the formula 

m 

*(/)= E /a(i)(io" - i)- , '*- 1) 

h = 0 

oo m 

- E io _Jv E/*0v-i + i)((io v_ 1 — i)-< ft+ — (io v —1)“<' ,+1 »). 

v — n +1 h = 0 

( 2 ) 


4. 

On putting 

r o = EA( 1 )(10"-1)“ , ' ,+1) , 

h = 0 
m 

r*-n = E /*(/»-1 + l)((10 v_1 - l)- (h+1, -(10 l - l)-'^ 1 *) 

for v > n + 1, 

i v = J n+v for v>l, 
the formula (2) can be written as 

°{f) =r 0 ~ £ r v 10"\ 


( 3 ) 
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a series of the same form as in the lemma. From their definitions, all the 
numbers r 0 , r l9 r 2 , ... are positive and rational. The coefficients f h ( 1) and 
fh(jv- 1 + 1) are positive integers. If further d N denotes the product 

d N = ((10 1 - 1)(10 2 - 1) • • (10* - 1 )r +1 , (4) 

then d N is a common denominator of the N + 1 rational numbers r 0 , r l9 ..., 
r ^, and 

d N < ^Ql + 2 + - + Nyn+l = |Q(m+l)iV(iV+l)/2 

In order to make use of the lemma, we require upper estimates for the 
numbers e v and e v . Such estimates can be derived from the formula (1) for 
g(y). It implies that for large v 

j v = a • 10 v/ " + 0(1) and J v = a(v - 1) - 10 (v " 1)/m + O(10 (v ” 1)/m ), 

(5) 

This implies that there are two positive constants c and C such that for 
large v, 

cv • 10 v/m < i v < Cv • 10 v/m . (6) 

Further, for h = 0, 1 ,m, and for large x, 

f h (x) = 0(x "), 

hence, by (5), for all such values of h and for large v, 

/fc(/v-i + l) = O(10 v ). 

On the other hand, for such h and v, 

(10 v_1 - l)- (/,+ 1) -(10 v - l)" (,,+ 1) = O(10“ v ). 

Therefore all the sums r 0 , r u r 2 ,... are bounded positive numbers, say not 
larger than 1(F — 1 where p is some positive integer. This means that for 
sufficiently large integers N the products d N r 0 , d N r u d N r 2 , ... are positive 
integers not greater than 

jQ(m+l)JV2 _ i 

therefore by (6) that 

00 

0< X d *r v - lO~ i ' <lO~ lON/m . 

v= JV+ 1 

The hypothesis of the lemma is thus satisfied with 

e N = (m + 1 )N 2 and s N = 10 N/ "\ 

Here e N > Ne N for all sufficiently large N , and hence the lemma leads to the 
following result. 
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Theorem The decimal fraction a(f) belonging to a polynomial f(x) of 
degree m has the following property. To every sufficiently large positive integer 
N there exists a positive integer d N of at most (m + 1 )N(N + l)/2 decimal 
places such that at least 10 N/m — (m + 1 )N 3 of the first lO N/m digits after the 
decimal point in the decimal expansion of d N o(f) are equal to 9. 
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The following theorem is proved. If g v g 2 , 9k is a sequence of elements from a group (written 
additively) such that (g v g 2 , g k > is not cyclic , then either the set of all sums g ti + g h + 
• • • + (1 < ij < i 2 < • • • < i t < k; t > 1) contains 0, or it contains at least 2/c — 1 elements. 


1. Introduction 

Given a sequence of elements g u g k (possibly with repetition) in a 
group G (written additively), we shall say that the sequence represents g if g 
occurs as a sum of the form 

9 = 9h + " + 9i, (1 < i‘i < ••• < i, < k; t > 1). 

For finite Abelian groups, the problem of determining the length of the 
longest sequence in the group that does not represent 0 has been extensively 
studied [1-3, 6], and, though solved for Abelian p -groups and for a number 
of other cases, remains unsolved in the general case. Much less is known for 
non-Abelian groups. In this paper we shall prove the following result for an 
arbitrary group. 

Theorem If the sequence g u .--,g k does not represent 0, and if the group 
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• > 0fc> generated by the g { is not cyclic , then the sequence does represent 
at least 2k — 1 elements . 

From this theorem we shall obtain the 

Corollary If G is a finite , but noncyclic group of order n, then every 
sequence in G of length k > %(n +1) represents 0. 

The corollary was proved for Abelian groups by R. B. Eggleton and P. 
Erdos [4]. In the same paper, Eggleton and Erdos proved that any sequence 
of k distinct elements in an Abelian group, which does not represent 0, does 
represent at least 2k — 1 elements. 

It is interesting to note that our theorem and corollary give the best 
possible results in the case of dihedral groups, and that this is so even if we 
take sequences of distinct elements. To see this, let G be the (dihedral) group 
of order n — 2v generated by a and b , where 2 a = 0, vb = 0, and b + a = 
a — b. For k < v, form the sequence 

a , a + b, a + 2b, ..., a + (k - 1 )b. 

It is easy to see that this sequence represents exactly 2k — 1 elements, namely 
a, a + b, ..., a + (k — l)b; b , 2b, ..., (k — \)b. 

More generally, the theorem gives, for a given k, the best possible result 
in any noncyclic group G that has an element x of order at least k. For if 
y e G, but y £ <x>, the sequence 

x,..., x, y (where x is repeated k — 1 times) 
does not represent 0 and does represent exactly 2k — 1 elements. 

2. Notation and Preliminaries 

If A and B are nonempty subsets of a group G, take A + B to be the set of 
all elements of the form a + b with a e A, b e B. The notation | X | will 
signify the number of elements in a finite set X. If a x , ..., a k are nonzero 
group elements, the symbol D = Z(a l5 ..., a k ) will always denote the sum set 

D = {0, aj} + {0, a 2 } + • * * + {0, a k }, 

that is, the set of all elements s 1 a 1 + ••• + s k a k , where e f = 0 or 1. Thus 

Oel. If e x a x + -b e k a k — 0 (e* = 0 or 1) has no solution other than 

= • * * = s k — 0, we shall say that 0 has only the trivial representation in E. 
By a coset of a subgroup H we shall always mean a right coset H + g. If E 
contains an element of a coset H + g, we shall say that E represents that 
coset. 
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We shall require the following lemmas: 

Lemma 1 If 0 has only the trivial representation in X = E^, ..., a k ), 
then | £ | > k + 1 and equality holds only when a 1 = a 2 = * * * = a k . 

Proof The lemma is easy to verify for k = 1, 2. Assume that k > 3 and 
that the lemma is true for sequences of length k — 1. Since 0 has only the 
trivial representation, 0 £ a x + X(a 2 , a k ). Therefore |X| > 1 + |Z(a 2 , 
..., a k ) |. The lemma follows, by induction, from this inequality except when 
a 2 = a 3 = • • • = a k and a x =f= a 2 . But, by symmetry, | X | > 1 + | X(a l5 ..., 
a*_ x ) |. Hence, in the remaining case, | X | > fc + 2. 

Lemma 2 Suppose <a> is a cyc/ic group of order n and S = {0, a, 2a,..., 
ta}, wfcere 0 < t < n - 2. 7/0 <a>, | S + {0, ft} | >t + 2. 

Proof If h lies in S , then — a + fc e S, but — a + fc^S + fc. If not, then 
h e S + h but h $ S. Thus, in any case, S f S + h. Since S + {0,li} = Su 
S + fc, we have | S + {0, fc} \ > | S \ = t + 1. 

Lemma 3 Suppose <a> is a cyclic group of odd order n and S = {0, a , 2a, 
..., ta}, where 2 < t < n — 1. If h e <a>, h is not a generator of <a>, and 
0 4 S + K then j S + {0, h} \ >t + 4. 

Frao/ Since <a> has odd order and h is not a generator, h f a, 2a. Also 
h 0 since h is in S + h but 0 is not. Thus h = ra, where 3 < r < n — 1. Since 
5 + h = {ra, (r + l)a, ..., (r + t)a} and 0 4 S + h, we must have r + t < 
n — 1. Hence 0, a, and 2a are not in S + h. Since these three elements are in S, 
it follows that | S + {0, h}\ > | S + h | + 3 = \S\ + 3 = f-h4. 

Lemma 4 (Kemperman) Suppose A and B are finite subsets of a group 
and 0 e A n B. If a + b = Q (a e A, b e B) has no solution except a = b = 0, 
then | A + B \ > \A \ + | B | — 1. 

The proof of this lemma may be found in Kemperman [5]. 


3. Proof of the Theorem 


Assume that the theorem is false and let a l9 ..., a k be a criminal sequence 
of minimal length fc. Thus <a 1? ..., a k ) is not cyclic, | E(a x ,..., a k ) \ <2k and 
0 has only the trivial representation in E(a 1? ..., a k ). Let H = <a x , ..., a,>, 
where t is the largest index such that <a l5 ..., a f > is cyclic. We may assume 
that our sequence is chosen, from among the criminal sequences of length fc, 
to have maximal value t. Let u be the largest integer such that <a k _ u + u ..., 
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a k } is cyclic. Under these assumptions we first show 

Lemma 5 Let E = E^,..., a k ). 

(a) |E| = 2k — 1, a k - 1 )| = |E(a 2 > •••> fl *)| = 2k — 2. 

(b) H + E = E. 

(c) H is a finite subgroup of odd order. 

(d) Z is the union of at least three cosets of H. 

(e) u < t < k — 4 and t < \H\ — 2. 

Proof We must have a x e < a 2 ,..., a k >, for otherwise a x H- Z(a 2 ,..., a k ) 
and Z(a 2 ,..., a k ) are disjoint sets, and so | Z | > 21 Z(a 2 ,..., a k ) \. But this is 
impossible since, by Lemma 1, | Z(a 2 ,..., a k ) \ > k. It follows, in particular, 
that <n 2 , ..., a k y is not cyclic. Thus, by the minimality of k, |Z(a 2 , ..., 
a k ) | > 2k — 2. Since 0 has only the trivial representation in Z, 0 ^ a x + 
Z(a 2 , a k ). This shows that |Z| > 1 + |Z(a 2 , ..., a k )\. Thus equality 
must hold in the last two inequalities. Similarly | Z(a 1? ..., a k - j) | = 2/c — 2. 
This proves (a). 

Since | Z | = 1 + | Z(a 2 , ..., a k )\ and 0 £ a x + Z (a 2 , ..., a k ), we must 
have that Z = {0} u a x 4- Z(a 2 , ..., a k ). Thus if x e Z and x f 0, then 


— a x +xe Zj (a 2 , ..., a k ). In particular 


xeZ, x^O-^-ai+xeZ. 

(i) 

Similarly, 


xeZ, x^0->x-n k eZ. 

(2) 

From (1) it follows that if x 6 Z but x ^ <«i), then x, 

— a x + x, —2 a x + 


x,... all lie in Z. Since Z does contain elements not in (a x }> we see that a x 
has finite order. Hence 

x e Z, x £ (uj)-* <<?!> + x ^ Z. (3) 

We shall show that the restriction “x f 0” in (1) can be eliminated, or, in 
other words, — a x e Z. To do this we need to know that a x f a k . We shall 
show that a k $ (af). Suppose, to the contrary, a k e {a x y. Consider the se¬ 
quence a k , a u a 2 , . a k ~ x . Clearly 0 has only the trivial representation in 
T.(a k , a u a 2 , ..., a k - 1 ). Let y e E^, a*-!). If y e < a t >, then a k and y 

commute, so a k + y = y + a k e E. If y f <aj >, then, by (3), <aj) 4- y ^ £, so 
a k + y e E. This proves that E(a k , a u ..., t ) £ E. Therefore a k , a u ..., 
a k - k is a criminal sequence of length k and (a k , a u a,) is cyclic. But this 
contradicts the maximality of t. Thus a k e {a,). 

By (1), — a, + a k e E. Since ai a k , we have, by (2), that (—a 1 + a k) ~ 
a k e E. Thus — a 1 e E. Thus (1) becomes: x e E -> —+ x e E. This gives 

< a i> + E = E. (4) 

Since the elements a,,..., a, commute, we may interchange cii with any cij 
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(j = 2,..., t), so (4) holds with a, replaced by a } . Thus <a^) + £ = £ for all 
j = 1,..., t. It follows that 

H + £ = £. (5) 

We see, by (5), that H is finite and that £ is the union of complete cosets of 
H. Hence | £ | = \H\m, where m is the number of cosets of H represented by 
£. By (a), | £ | is odd, hence | H | and m are odd. But m > 1, so m > 3. This 
proves (b), (c), and (d). 

Clearly —a k ,...,—a 2 ,—a l is a criminal sequence, so, by the maximality 
of f, t > u. By Lemma 1, |£(a l5 ..., a,)\ > t + 1. Thus t+ 1 < |H|. If 
t = | H | — 1, then, by Lemma 1, £(a l5 ..., a t ) = H. But this means that 
H + £(a lt ..., dj) = £(a,,..., a,), for all t <j <k. Hence |H | divides |£(a t , 
..., aj )| for all t<j<k. But this is impossible since t<k, ]£(«!, ..., 
a*_!)| = 2k — 2, and |£| =2 k- 1. Thus t < \H\ -2. 

Since 2k — 1 = | £ | > 3 | H \ > 3t + 6, we have k >t + 4. This com¬ 
pletes the proof of (e) and the lemma. 

Lemma 6 Either (i) there is an index s such that 

| £(a„ ..., a s ) | >2s + t, (6) 

or (ii) t = 1 and a 2 = a 3 = a 4 . 

Proof By Lemma 5, £ represents at least three cosets of H. Let v be the 
smallest index such that £(a x ,..., a„) represents three (or more) cosets of H. 
Thus v >t + 2. Let S = £(a,, ..., a,). If w e £(a, +1 ,..., a„), then S + w £ 
£(fli, • • •, a„). Thus £(aj,..., a„) contains at least | S | elements in each coset 
it represents. Since £(aj,..., a v ) represents at least one more coset than £(a,. 
.... a„_ t ), we have 

|£(a l5 ..., a„)| > |£(a l5 ..., a„_!)| -I- |S| > 2v - 2 + |S|. 

By Lemma 1, | S | > t + 1. If | S | > t + 2, we are done. Also we are done if 
£(a 1( ..., a v ) represents four or more cosets of H, for then 

I ^(^i, • • •, Uy) | ^ 2v — 2 + 2 j S | ^ 2v — 2 + 2(f + 1) > 2v + t. 

By Lemma 1, | S\ = t + 1 only when a x = a 2 = ■■■ = a,. 

Thus we may assume that = a 2 = ■■■ = a, = a, hence S - {0, a, 2a,..., 
ta}, and that £(aj, ..., a v ) represents exactly three cosets of H. 

Let x = a l + j. By the meaning of v, £(a„ ..., a„_ t ) c H u H + x. Thus 

we may write £( 0 !. a v -,) = A u B + x, where S £ A, B c H. The third 

coset represented by £(a t ,..., a v ), but not by £(a,,..., a„_ t ), is either H + a v 
or H + x -I- a v . Thus if | A | > t + 2 and | B \ >t -f 2, we are done and (6) 
holds with s = v. For notational convenience, we shall label the first t + 3 
elements of our sequence (recall k > t + 4 by Lemma 5) as a,..., a, x, y, z. 
We consider five cases. 
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Case 1 v >t + 4 In this case E(x, y, z) ^ H u H + x. Since the three 
elements y, z, and y + z lie in H u H + x, at least one of them lies in H. 
Label that element h v Clearly /j A ^ 0. Thus x + /i x e E(x, y, z). Now x + h x 
is not in H, hence x + h l = h 2 + x, where 0 ±h 2 s H. Thus A 2 S + {0, h J 
and B 3 S + {0, fc 2 }. By Lemma 5, t < \H\ — 2, hence Lemma 2 gives \A\ 9 
\B\ > t + 2. 

Case 2 v = t + 3 and x ^ y Thus E(a,..., a, x, y) = ,4 u B + x. Since 
x ^ if, we have JT + x + y=£// + y. Thus one of x + y and y lies in H and 
the other lies in H + x. Since neither is 0 or x, they have the form h u h 2 + x, 
where /i 1? ft 2 0 and /i l9 fc 2 e H. Thus A 2 5 + {0, fcj} and B 3 S + {6, /z 2 }. 
Again | X |, | B | > t + 2, by Lemma 2. 

Case 3 v — t + 2 and x y The four elements 0, x, y, x + y are dist¬ 
inct, and Z(a, a, x, y) represents exactly three cosets of H, hence some 
two of these four elements lie in the same coset of H. Thus £(x, y) contains 
two elements of the form w, ft + w, where 0 ^ h e H. By Lemma 2, 

| S + {0, h} | > t + 2, hence at least t + 2 elements of Z(u,..., a, x, y) lie in 
H + w. Since the other two cosets each contain at least t + 1 elements of S(a, 

..., a, x, y), we have | L(a, ..., a 9 x, y) | > 3t + 4 = 2u + t. Thus (6) holds 
with s = a 

Case 4 i? = t + 3 and x = y In this case x + y = 2x = h lies in H and 
h ± 0. Since <a,..., a, x> is not cyclic, h is not a generator of H = <a>. Also 
0 ^ S + h since 0 has only the trivial representation in E. 

If r > 1, then (since H has odd order, by Lemma 5) Lemma 3 applies, and 
we have | S + {0, h] \ > t + 4. Hence | E(a,..., a, x, y, z) | > 3r -h 6 = 2i? + t, 
and so (6) holds with s = v. 

If t = 1, then A = S + {0, h} — (0, a, a + h, h}, and | A | =4, since 
h y= 0, ±a. If z ^ H, then 14 u >4 + z | >8, and so | E(a, x, y, z) | > 10. If 
z e H, then 0 £ 4 + z, since 0 has only the trivial representation; therefore 
| A 4- {0, z} | > \A | +1 = 5. Thus E(a, x, y, z) contains five elements of H , 
and so | E(a, x, y, z) | > 9. Thus in either case (6) holds with s = v = 4. 

Case 5 i; = t + 2 and x = y Thus E(a, ..., a, x, x) = S u S + 
x u S + 2x, where 0, x, and 2x are in different cosets of H. If E(x, x, z) 
represents five or more cosets of H , then | E(a,..., a, x, x, z) | > 5(f + 1) > 
2(t; + 1) + t. If E(x, x, z) represents four cosets of H and t > 1, then |E(a, 
..., a, x, x, z) | > 4(t + 1) > 2(t; + 1) + t. So, in either of these two cases, we 
may take s = v + 1 in (6). 

Assume t = 1 and E(a, x, x, z) represents exactly four cosets of H. We 
may assume that z =f= x, for otherwise statement (ii) of the lemma holds. 
Since z =£ x, E(x, x, z) contains at least five elements by Lemma 1. Hence 
two elements of E(x, x, z) lie in the same coset of H. Thus w, h + 
w e E(x, x, z), where 0 ^ h e H . Since a has odd order, S + {0, /i} = 
{0, a, h, a + /i} contains at least three elements. Thus H + w intersects 
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X(a, x, x, z) in at least three elements. It follows that |X(a, x, x, z)\ >3 4- 
2 + 2 4- 2 = 9, hence (6) holds with s = v + 1 = 4. 

There remains only the possibility that X(a, ..., a , x, x, z) represents 
exactly three cosets of H. Thus the three cosets H 4- z, Zf + x + z, 
H + 2x + z coincide, possibly in different order, with H, H 4- x, H + 2x. If 
S + ix + z ^ S 4- jx for all 0 < i, j < 2, then each coset H 4- ix (i = 0, 1, 2) 
intersects X(a, ..., a, x, x, z) in at least | S | 4- 1 = t + 2 elements, and so 
| X(a, ..., a, x, x, z) | > 3t 4- 6 = 2(f 4- 3) 4-t, which means that (6) holds 
with s = *4-3 = t?4-l. Assume that S + ix + z = S+jx for some pair i, j 
(0 < i, j < 2). Then S = S 4- (ix 4- z — jx). Then by Lemma 2, ix 4- z — 
jx = 0, so z = (; — i)x. Since the three elements z, x + z, and x + x 4- z 
cannot be zero, we must have z = x or z = 2x. In either case 3x e X(x, x, z), 
3x = h e H, 0 ^ S + /z, and, since <a, x> is not cyclic, h is not a generator of 

H. 

If t > 1, then, by Lemma 3, | S + {0, h] | > t + 4. Hence H intersects L(a, 
..., a, x, x, z) in at least t 4- 4 elements, and therefore |Z(a, ...» a, x, x, 
z)| > 3t 4- 6 = 2(t + 3) + L Thus (6) holds with s = t + 3 = i;4-l. 

If t = 1, we may assume z ^ x (otherwise (ii) holds), so z = 2x. Then by 
Lemma 1, | £(x, x, 2x) | > 5. But a ^ <x), so a 4- £(x, x, 2x) and X(x, x, 2x) 
are disjoint. Thus |Z(a, x, x, 2x)| > 2|Z(x, x, 2x)| >10, and again (6) 
holds with s = 4 = v 4- L This completes the proof of the lemma. 

It is now easy to obtain a contradiction from all of this. Suppose first that 
statement (i) of Lemma 6 holds. Thus there is an index s for which | E(u l5 ..., 
a s )\ > 2s + t. We may assume, of course, that s < fc, thus X = S(aj, ..., 
a s ) 4- !, ..., a k ). Since 0 has only the trivial representation in X, 0 has 

only the trivial representation as a sum 0 = a + 6, aeXjaj, ..., a s ), 
b e X(a s+ !,..., a k ). Thus, by Lemma 4, 

I £ | ^ I £(oi> • • •, a s ) | + | £(a s+ « t ) | - 1- 
If (a s+19 ..., a k > is not cyclic, we have | X(u s+ 1 ,,.., a k ) \ >2(k — s), hence 
| X | > 2k + t — 1 > 2k, and we are done. If <a s+ u ..., a*> is cyclic, then, by 
the definition of u, k — s < u. Also | X(a s+ x ,..., a k ) \ > k — s 4- 1, by Lemma 

I. Thus |X| >s + k + t>2k + t — u>2k and we are done. 

Suppose that statement (ii) of Lemma 6 holds. Thus t = 1 and 
a 2 = a 3 = a 4 . Let r be the largest integer such that < a 2 , a 3 ,..., a r+ j > = Q is 
cyclic. Thus r > 3. Also r 4- 1 < k since u = 1. 

We first show that X(a 2 ,..., a k ) represents at least three cosets of Q. The 
elements 0, a Y 4- a 2 , —a x 4- li e i n ^ by (b) of Lemma 5. By (a) of Lemma 
5, there is only one element in X that is not in X(a 2 ,..., a k ). This element 
must be —a l since — a x e X by (b) of Lemma 5, and since — a x X(a 2 ,..., 
a k ) since 0 has only the trivial representation in X. Now 0, a x +a 2 , 
— a x 4- a 2 are distinct from — a x since a l has odd order and a x 4 Q • Thus 0, 
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cii + a 2 , — a 1 +a 2 e Z(a 2 , a k ). Therefore Z(a 2 , ■■■, a k ) represents the 
three cosets Q, Q + a k + a 2 , Q - a t + a 2 , which are distinct, again since 
has odd order and a x £Q. 

Let v + 1 be the largest index such that Z(a 2 ,...,a v+1 ) represents three 
or more cosets of Q. Since a 2 = a 3 = a 4 e Q and |Z(a 2 , a 3 , a 4 )| >4, every 
coset of Q represented by Z(a 2 , ..., a„ +1 ) intersects Z(a 2 , . a v+i ) in at 
least four elements. Thus |Z(a 2 , ..., a„ +1 )| > 4 + |E(a 2 , ..., a„)|. Since 
r > r + 1, <a 2 ,..., a„> is not cyclic, hence | Z(a 2 ,..., a„) | > 2(v - 1), and so 
|Z(a 2 , ..., a v+ j)| > 2(v + 1). By Lemma 5, |Z(a 2 , a k )\ - 2k - 2, so 
v + 1 < k. Lemma 4 gives 

|X(a 2 ,...,a*)| >2v+l+ \I.(a v+2 , a*)|. 

If <a„ +2 , ..., a k ) is not cyclic, then |Z(a„ +2 ,..., a k )| >2(k-v- 1), and 
hence | Z(a 2 ,..., a k ) | > 2k — 1, which is a contradiction. If <a K+2 ,..., a k ) is 
cyclic, then k — v — 1 = u = 1 and so |S(a 2 , ..., a k )\ > 2k — 1, which is 
again a contradiction. This completes the proof of the theorem. 

To prove the corollary, suppose a l9 a k is a sequence in a noncyclic 
group G of order n , and k > j(n + 1). If the sequence does not represent 0, 
then | Z(a l9 ..., a k ) \ > k + 1 > n/2 by Lemma 1. Hence <a*,..., a k > = G. 
But then, by the theorem, |£(a l9 ..., a k )\ >2k> n, which is impossible. 
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Concerning a Possible “Thue-Siegel-Roth 
Theorem” for Algebraic Differential Equations 

CHARLES F, OSGOOD 

NAVAL RESEARCH LABORATORY 
WASHINGTON, D.C. 


Let K denote a field of characteristic zero. Let y 1 denote a formal power series about z = oo with 
coefficients in K which is a solution of a nonzero algebraic differential equation having its 
coefficients in K. Let ord denote order of vanishing at z = oo. We show that for each e > 0 there 
exist a constant c(e) > 0 and a nonzero algebraic differential equation having coefficients in K , 
such that for all r , s £ 0 in K[z] either , (i) ord^j — rs~ l ) < (2 + e) deg s, (ii) deg s < c(e), or (iii) 
rs -1 is a solution of A t . In a number of cases both c(e) and A e are effectively computable. If one 
could eliminate case (iii), a very strong statement of approximation would follow , a statement 
analogous in many ways to the Thue-Siegel-Roth theorem. Inequality (i) is best possible in the 
sense that , as one may easily show , given any formal series y t with ord y l = 0 and any positive 
integer n there exist polynomials r and s, with deg s < n, such that ord(sy — r)> n > deg s. 
(Consider the rank of the associated system of linear homogeneous equations.) 


Introduction 

Kolchin, in [2], suggested the problem of proving an analogue of the 
Thue-Siegel-Roth theorem for (at least) all formal power series y l9 with 
coefficients in a field K of characteristic zero, that satisfy an algebraic differ¬ 
ential equation. He obtained what may be thought of as a “Liouville type” of 
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bound on the “diophantine approximation” of such formal series by rational 
functions. In two recent papers [3, 4] the present author used the methods of 
differential equations in order to obtain effective bounds on the “diophan¬ 
tine approximation” of algebraic functions over fields K of both zero and 
positive characteristic. The present results represent a considerable generali¬ 
zation of the methods used by Osgood [3] and Osgood [4]. We obtain below 
an incomplete solution to the Kolchin problem, as we understand it and 
have interest in it; i.e., for each power series y x and every e > 0, our analogue 
of the Thue-Siegel-Roth theorem holds, for all rational functions not satisfy¬ 
ing a certain algebraic differential equation, P E = 0. Here P e ^ 0 is a differ¬ 
ential polynomial depending upon both the differential equation satisfied by 
y x and upon s. In many cases P E is effectively computable. 

Since our method could be used to construct a number of (potentially) 
distinct differential polynomials P E , one would hope that future work in this 
area might showt (sometimes even effectively) that beyond some point the 
only common zeros of all of these differential polynomials P E are the solu¬ 
tions to our original algebraic differential equation for y^ Then if, say, y x is 
not a singular solution of our original differential equation for it (which we 
shall have to assume below anyway) the argument of Kolchin in [2] would 
suffice to complete a (noneffective) proof of our analogue of the Thue- 
Siegel-Roth theorem. The effective approach used in [3] by the present 
author might then even allow one to obtain this latter result effectively, in 
certain cases. 

Finally, the approach of this paper can be carried out in more general 
contexts. In particular, in Theorem II below, we consider what can be shown 
over an arbitrary differential field of characteristic zero. We have no interest 
here in other valuations than ord. Extensions to other valuations are likely. 
(Presumably, with minor changes, the proof would also go through for fields 
of positive characteristic. This is of less interest, perhaps, since there exist 
algebraic differential equations, such as y ip) — 0, which have in their solution 
set every formal power series with coefficients in any field of characteristic 

P > o.) 

1 . 


Definitions Let K denote a field of characteristic zero. Let z be tran¬ 
scendental over K. Let A denote a nonzero algebraic differential equation 
over K, i.e., A is of the form a(y, y (1) , ..., y (n) , z) — 0 where each 

t (February 1976) As stated, this now appears unlikely to me; however, it would be enough 
to show that y t is not a limit point (in ord) of the set of common zeros of all of these differential 
polynomials. 
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y 01 = (i i/dz^y , 0 < j < oo, n is a nonnegative integer, and a(y, y (1> ,..., y (n) , z) 
is a not identically zero polynomial in n + 2 variables with coefficients in K. 
Let y t denote a (generalized) formal power series solution of A about z — oo, 
i.e., 

yi= i>jZ _J+ * 

1 — 0 

for some integer m and a set of elements a* in K. Suppose that y x is not a 
solution of doc/dy {n) = 0. Let ord /denote, for any formal power series/about 
z = oo, the order of vanishing of/at z = oo. 

Theorem I (i) For each e > 0 there exist a nonzero algebraic differen¬ 
tial equation A E with coefficients in K and a positive constant c(e) such that 

ord(y x — rs _1 ) < (2 + a) deg s (1) 

for all polynomials r and s =}= 0, in z, with coefficients in K , such that rs ~ 1 is not 
a solution of A e , and deg s > c(e). 

(ii) IfK equals the rational field, then A e is effectively computable, and if 
we know, in addition, a lower bound on ord y l9 then c(e) is also effectively 
computable. 

The following theorem strengthens Theorem I in a rather surprising 
manner. 

Theorem I' Suppose, under the hypotheses of Theorem I, that addi¬ 
tionally ord y x >0, then Theorem I holds (for a potentially different z4 e ) with 
c(a) = 1. IfK = Q, the rational field, and if there exists an effective algorithm 
for deciding if y i satisfies a given algebraic differential equation with 
coefficients in Q, then our new A e is effectively computable. 

(To handle the general case multiply y x above by z~ M , where 
— M < ord y t , and apply Theorem V to z~ M y u where it is being approx¬ 
imated by z~ M rs ~ 1 .) 

Theorem I' holds out the (remote) possibility that the constructions in 
the present paper might someday be used to prove an inequality of the form 
ord(y x — rs“ J ) < 2(deg s) + rj(y j), for some rj(y i) > 0 and with no excep¬ 
tions, since one may now choose e > 0 after a specific rs _1 has been chosen. 
On the other hand, if deg s > l, and we choose e < (deg s )“ l , then (1) would be 
false infinitely often (see the abstract). The (hopefully not usual) case that rs ~ 1 
satisfies A e is forced upon us under these circumstances. 

Theorems I and I' will follow easily from a more general result: 

Definitions Let F denote a differential field, of zero characteristic with 
derivation <5. (For these definitions, see Kolchin [1].) By a nonzero differen¬ 
tial polynomial P over F, we mean an expression of the form P(y, y (1) ,..., 
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y <n) ) where y is a differential indeterminate (see Kolchin [1] again) y 0) = fry, 
0 <j < oo, the parameter n is a nonnegative integer, and P(y, y ( 1 ..., y ln) ) is 
a polynomial in n + 1 variables over F. (The extension of this definition to 
more than one differential indeterminate is straightforward.) 

By the “denomination” of a differential polynomial P = P(y, y (1) ,..., y (n) ) 
over F we shall mean the smallest nonnegative integer d = d(P) such that 
/P(wy~ l , ..., (wy -1 )'"*) is a differential polynomial in the two differential 
indeterminates w and y. By the principal differential ideal (in F[y, y (1) , 
y u \ ■ ■ •]) generated by a differential polynomial we shall mean the smallest 
(algebraic) ideal in F[y,..., y^\ ...] that both contains a and is closed under 
differentiation (as we shall refer to the action of the derivation). We denote 
this ideal as (a). Similarly, if our ring is F[y, y U) ,...; X u X\ 1 ] ,X„, 
Xj, 1 ^ ...] where y, X t , ..., X„ are distinct differential indeterminates, we 
denote the corresponding ideal as (a)F[X 1? ..., X„], 

Theorem II (i) For each nonzero differential polynomial a = a(y, ..., 
y n) ), with da/dy (n) ^ 0, and each e > 0, there exists a positive integer q>, a 
nonzero differential polynomial P e e F[y, y U) , ...], and a real number 0 t > 1 
satisfying d(P e ) 6 ~ 1 < 2 + e, such that each 



Pe e (a) 


for all sequences of nonnegative integers e 0 ,e u ... with Yj e j < 8 e ■ 

(ii) Further, one may effectively compute P e if F is the field of rational 
functions in one variable with rational ( number) coefficients. 


2 . 


We wish to see that Theorem I follows from Theorem II. By Theorem II, 
given e > 0 there exists (or we may effectively compute) a differential poly¬ 
nomial P el2 such that each 


/ da. W/ d \ eo I d \ e --- 
\dy <n) / \dy/ \<V J 7 


p e/2 e («) 


if Yj e i < 6*12 , where 6 e/2 is such that d(F e/2 )(0 £/2 ) _ 1 < 2 + e/2. Then we see 
that the (finite) Taylor series expansion of P e/2 in powers of (_y — y,), 
(y - yi) (1) ,... has zero coefficients for all terms of total degree less than 0 cl2 
in (y - yi), (y - yi) (1) ,.... Let be P £/2 = 0. If 


Pe,i((r/s), (r/s) (1> , ...)=)= 0 


then, since ord of a nonzero rational function is at most the degree of the 
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denominator, 

ord P el2 (r/s, (r/s) (1) ,...) < d{P el2 ) deg s. 

Suppose ord(r/s - yi) > 0 and 

ord(r/s - > (2 + e) deg s > (d(P E , 2 )(9 e , 2 )“ 1 + is) deg s. 

Then we would have that 

0 e/2 ord(r/s - y,) > d(P e/2 ) deg s + ie(deg s) 

> (ord P e/2 (r/s, (r/s ) (l) ,...)) + ifi(deg s), (2) 

which leads to a contradiction for all s of sufficiently high degree. The bound 
on the degree of s depends upon a lower bound on the order of vanishing at 
z = oo of the coefficients of the (finite) Taylor series for P c/2 in (y — y x ), 
(y — y i) (1) , • • ‘ • If Pei 2 i s effectively computable and if we can bound ord y x 
from below, we can place an effective upper bound on the degrees of those 
polynomials s for which there is no contradiction. This proves Theorem I, 
assuming Theorem II. 

We shall next see that Theorem V also follows from Theorem II. Where 
a = 0 is the original differential equation for y, suppose decfdz = 0. Then, 
taking F = K in Theorem II, we have dPJdz = 0. If, also, ord y x >0 
inequality (2), above, now implies a contradiction regardless of the degree of s if 
deg s > 1. We shall next show that there exists /? ^ 0 in (a) with df/dz = 0. 
[Since y x is a zero of /?, obviously fi £ K. One may then proceed in our 
present case to take successive partial derivatives of this jS, with respect to 
the highest derivative of y appearing in it, finally arriving at a (possibly) new 
p = p(y, y (l \ ■■■, y (m) ) such that P{y 1 ) = 0 but (<3^/<3y ,m) )(y 1 ) ^ 0.] Our 
procedure below will be effective under the hypotheses of Theorem I'. We 
proceed by induction on N = deg z a. If N — 0, there is nothing to prove. Set 
a = Yj =o w h ere N > 1, each detj/dz = 0, and <x N ^ 0. If a N e (a) we 
have nothing to prove. If ol n 4 (a), then aa^ 1 4 K (otherwise a N = a(a N a -1 ) 
would be in (a)). Using the chain rule, we see d(aa^ l )/dz has degree in z at 
most N — 1 and is nonzero. Thus a£ d(aa* x )/dz ^ 0 is in (a) and has degree 
in z at most N — 1. This proves Theorem T. 

Let w, v , and y be differential indeterminates. Let be chosen, 
0 < s t < 1. In the remainder of Section 2 we shall show how one may construct , 
for each sufficiently large positive integer 0, a polynomial Q in the 20 variables 
u, m (1) , ..., u {9 ~ 1} , v , t> (1) ,..., v i9 ~ with coefficients in F, having the following 
properties: 

(i) The total degree of each monomial ofQinu ,..., u (9 ~ 1] , v 9 ..., v {9 ~ X) is 
at most 26. 



228 


CHARLES F. OSGOOD 


(ii) There exists a positive integer <p such that at v U) = ( uy) U) , j — 0, 
1,..., each 


if 


( dotY/dV 0 ( d , 

fe®] (s) "wH G e («)f[4 


-£i). 

j =o 

Lemma I The number of solutions in nonnegative integers of 


j= i 

for positive integers y and f is (^T 1 ). The number of solutions of 

i*j * it 

is ( y+ /). 

Proof By placing y — 1 “cuts” in a sequence of /? marks we define a 
unique solution above. This is equivalent to choosing y — 1 marks out of 
y + f — 1 marks to be “cuts.” Letting S y+l be 

p-i s J 

j= i 

we see that ( y ^) is the number of solutions of 

isj<p- 

j= i 

Lemma II If cl has total degree d > 1 in y, y {1 \ ..., y ij \ ..., y {n \ where 
n > 0, then for each positive integer 9 every 



may be written modulo (a) as a linear combination over F of monomials in y, 
..., y (n \ each having total degree < 4 Od. 

Proof If j = 0, 1, ..., n, there is nothing to prove. Let denote the 
derivation which agrees with S on F and is such that S 1 y (j) = 0J = 0 , 1 , .... 
Now if n + 1 < j < 9 we see that, using the chain rule j — n times, 
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while not even in F[y, / 1) , ..., y U) , ...] is such that 


Then each 



r, 6 (a). 


0 < j < 0 - 1, 


may be written modulo (a) as a linear combination over F of monomials in 
y, y (1) ,y (n) having total degree less than 4 Od. This proves Lemma II. 


Using Lemmas I and II we wish to see that we can construct our above- 
mentioned polynomial Q. We need to solve a system of homogeneous linear 
equations with coefficients in F, nontrivially. Therefore, we must show that we 
have more variables than equations. Our variables are the coefficients of the 
distinct differential monomials in u, ..., u {9 ~ 1) , v, ..., v (9 ~ 11 (potentially) 
appearing in Q. We have ( 29 2 e 2e ) = ( 2 #) such coefficients by Lemma I. The 
number of distinct monomials in u, ..., u (fl_1) which we have upon setting 
v <k) = (uy) (k) after 0 <j < 0(1 — £ 1 ) partial differentiations, is 
( 2 e ~J +e ) = ( 3 V)- The number of distinct partial derivatives of order j > 1 is 
less than ( 9 )-') by Lemma I. Thus the total number of monomial coefficients 
which need to be equated to zero is at most 


Tr.'irr) 

Set q> = 20(20 - 1) < 4 0 2 and apply Lemma II. Equating each of these mono¬ 
mial coefficients to zero leads to at most (" + it J 9 *') homogeneous linear equa¬ 
tions with coefficients in F, using Lemma I. 


Lemma III For all 0 larger than some effectively computable integer, 

Clearly Lemma III will suffice to show the existence of Q. If the field F is 
the field of rational functions over the rational numbers, we may solve the 
above system of linear homogeneous equations effectively by putting the 
matrix of coefficients in triangular form.t Clearly a lower bound on 0 may be 
effectively computed above in terms of and bounds on the order of a and 
the degrees of a in y, ..., y (j) , ..., y (n) , for all differential fields F. 


t That is, where (a, ; ) has rank n , each a { J = 0 if i > n or i < j , while a ltl • =(= 0. 
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Proof of Lemma III If 8 is sufficiently large 
|(n+l) + 8^j <02 „ +3 


Now 


T(V)M 

will be shown to be less than 

where j l is the least integer such that j x >e 1 8. First 


/30-O-l)\ 

(0+j-l) 

\ 0 ) 

\ 0 J 

(36-(j-l)\ 

(0+j- l) 

\ S ) 

\ 0 J 

(36-U-l)) 

(0+j- l) 

\ 0 ) 

\ ^ J 


1 + 


(30-;+1); ) 

2 W + i-j) \ 
(36 — j + l)jj 


for ; = 1, 2,..., 6. It follows that 




Now 


•"f* e’Te 1 ') - S {(> + j^)(* + j&)) 


< 0 2 " + 4 | 


2 \» 


(1 - j \(2 0 )- 2 )' 
a 


< 0 2n+4 ( l -^) 2 ) 1 

From the expansion of (1 + l) 2 ", we see that 

(2 n 


2 2n > 


Thus 


> (2n + 1) _1 2 2 ". 

< 2 49 < |;;|( 40 + 1 ), 
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from which we see, 

0 2n+4 (i - &i) 2 r( 2 ^ 2 < 50 2n+5 (i - (iei) 2 ) 9 ^)- 

Lemma III now follows. 


3. 

In what follows m, v, y , and Y are differential indeterminates. Now 
Q(Yu 9 ...,{Yuf-'\ Yv 9 ...,(Yvf~ l) ) 


may be rewritten, where e 0 ,..., e 0 ~ t are nonnegative integers, as 


Z ‘?( e o . • 

(e 0 , •••, ee- i) 

... e 9 _, M <fl u , i>,. 

WnW 

j=0 

Notice that at 

v = wy, 

1 

© 

II 

1 

© 

[or v = uy\ 

each 


... e e _, M (9 u , »,.. 

., v {9 ~ *') e (a)F[u] 


since, at v — uy , 

[ w )* Q{Yu ' • ’ (yu)<<> " ” Yv ' -' {Yv)i9 ~ l)) 6 (a)F[yu] 

<= (a)F[u, Y], 

Lemma IV If 

X/i<0(i-4 

j= 0 

each 

I dot \ 9 I / 5 \ /o / 3 

\aywj X \w ) 

x q(e 0 ,, u (9_ 1) , t>,..., i/ 9- °) 6 (oc)F[m], 

at v = wy. 

Proo/ Lemma IV follows if upon applying 

/ gq \ * / !d\l d V °- 1 \ 

\5y (r,) / X \\3i;/ \di/ 0_1) / / 
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to 


Q(Yu,(Yuf~ l \ Yv, ..., (y^' 1 *), 
and setting v = uy , 

we obtain an element of (a)F[u, Y]. Using the chain rule for partial differen¬ 
tiation, we see that what we have above is 


W ( "7 M U=o \j) d(Yv) (k) J 

x Q(Yu, ( Yuf - 1 ', Yv,(Yvf-"). 

This last quantity belongs to 

(<x)F[u, 7] at v = uy 

since, if 


i7;<0(i-a 

j=0 

for each choice of nonnegative integers kj , 



x e(y M , ..., (Yuf - 1 \ Yv ,..., (y*;) ( *- 1} ) € (a)F[y W ] at v = uy. 
This proves Lemma IV. 


We shall next find P £ and show that it has the desired properties—thus 
concluding the proof of Theorem II. From Lemma IV we see that each 
nonzero 


q(e 0 , ..., e„_i, u, ..., u i0 l \v,...,v {0 l) ) 
satisfies the same hypotheses as 

Q(u, ..., u (0 ~ l \ v,...,* 9 ' 1 '). 

Further, each is homogeneous, i.e., every monomial has the same total 
degree (in the derivatives of u and of v). Then, without loss of generality, we 
may assume in what follows that Q is homogeneous. We have either 

Q(Yu,...,(Yuf- i \(Yv),...,(Yvf~ 1) ) 

= Y m Q(u,...,u ie - i \v,...,v ie - 1) ), 

for some nonnegative integer m < 20, or there exists a nonzero 

q(e 0 m <9 ~ l) ,v,v (e ~ u ) 
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with e x + • • • + e 9 - x > 0. In the latter case replace 


by a nonzero 


q(e 0 > *• •) c q —m y v ,.. 


r (9 - 1) ) 


with e, + ••• + e 8 _i > 0 and iterate the above argument. After at most 
2d 2 (i e e) steps (the number of differentiations occurring in our original poly¬ 
nomial Q is less than 2 6 2 (it)) we must arrive at a new polynomial Q such 
that 


Q(Yu,...,(Yuf- l \ Yv, ...yiYvf-") 
= Y m Q{u,...y V, r (9_1) ), 


for some nonnegative integer m < 26. 
Then we have 


= u m Q( 1, 0, ..., 0; (v/u),(v/uf-"). (3) 

Replace v/u by y, set and set 

6(1, 0, ..., 0, y, ..., y<*~ 1J ) = P e (y, ..., y<»- X) ) = . (4) 

We must check that P E satisfies the conditions stated in Theorem II. Now 



at v = uy. By the construction of Q the final quantity above is in (a )F[u} if 

1 -£i). 

J=0 


It follows that each 



e&- l 

P e (y,.,.,y«-")e( a). 
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if Xj= o e j < #(1 — £i). Notice that by (3) and (4) the denomination of P £ , 
i.e., d(P £ ), is at most m < 29. Hence, setting 9 e = 0(1 — e t ) > 1, we have 
(assuming without loss of generality that 0 < e < 1) 

d(P e )9 7 1 < 2(1 - e,)- 1 < 2 + 3 £l = 2 + a. 

If the field F equals the rational functions over the rational numbers, our 
construction was effective. This proves Theorem II. 
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The minimum discriminant of seventh degree totally real algebraic number fields is computed. It is 
20 134 393 and is attained only by the field Q(p), where p is a root of the monic irreducible 
polynomial x 1 + x 6 - 6x 5 - 5x 4 + 8x 3 + 5x 2 - 2x - 1. 

We determine the minimum discriminant essentially in the same way as 
J. Hunter did for quintic fields [1]. Two modifications and a fast computer 
enable us to handle the large number of polynomials to be considered. The 
modifications consist of a better bound for the polynomial coefficients 
(Theorem 1) and of a new method of Zassenhaus [6] for determining integral 
bases. Most of the underlying theory is developed by Pohst [4]. 

Let F be a totally real algebraic number field of degree n — 7. Then 
F — Q(p) holds, where p is a root of the monic irreducible polynomial 

f(x) = x 7 + a t x 6 + + a 6 x + a 7 e Z[x]. (1) 

Moreover, / must have only real roots. We assume that we already know 
that d 0 = 20 134 393 is a discriminant of some field F. So we shall prove that 
there is no smaller discriminant d F . In the course of our considerations it 
will also become clear how to obtain d 0 for a start. 
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We must show that there is no polynomial/the root p of which generates 
a field F = Q(p) with a discriminant d F <d 0 . The underlying idea is the 
following: If such a polynomial/exists, then there is also a polynomial/with 
a root p and F = Q (p), and the coefficients of/satisfy some inequalities 
< S t with suitable bounds S ( (i = 1, 7); that is, we must consider 

only finitely many polynomials which can be done by a computer. So at first 
we must derive bounds S t which of course will depend on d 0 . 

Theorem I Let F be a totally real algebraic number field of degree 1 and 
discriminant d F <d 0 . Then there is a generating element p,F = Q(p), p a root 
of a polynomial (1) with 

l tr (/>) I = | | < 3, (2) 

tr(p 2 )<i + ®d F y 16 <i + mdo) m - (3) 

From Theorem I we obtain bounds S l9 S 2 , S 7 by simple calculations 
involving only the inequality of the arithmetic and geometric means. 

a x e {0, 1, 2, 3}, 

— Hi + (ff^o) 1/6 — a*) < a 2 < — i(8 — aj) 9 

I a 7 1 < ((«? “ 2a 2 )/7) 7/2 , a 7 f 0. (4) 

Unfortunately Theorem I does not yield results for S { with 3 < i < 6. We 
get those bounds from the fact that/must have only real zeros. Then also/ (4) 
has only real zeros and the discriminant of/ (4) is positive: 

I a 3 + 49 a i{2ai — 7u 2 )| < TZi\/h(^ a \ ~ 7^ 2 ) 3 - (5) 

Since it is difficult to compute discriminants of polynomials of degrees n > 4, 
we transform the property of/ into a statement about a quadratic form. 

Theorem II Let f e Z[x] be given as in (1). The polynomial fhas only real 
roots if and only if the symmetric matrix A = (a^) e GL( 6, Z) with entries 

3 

a u =7(7 - ifacij - 7 £ (2k + i -j)a i+k aj- k 

k= 1 

(Uj= 1 , —, 6; i>j; a 0 = 1 ; a m = 0 if m > 1 or m < 0) 

( 6 ) 

is positive definite. The value of the discriminant D f of f is given by 

D f — 7~ 5 det A. (7) 

Theorem II is very important for two reasons. First, we get bounds for 
coefficients of/from the fact that the matrix A must be positive definite, i.e., 
all principal minors of A must be positive. Thus the two-row principal minor 
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of A yields an upper bound for a A . If a 2 < 0 holds, we get a lower bound for 
a 4 by Newton’s inequalities 



For a 2 = 0 (a 2 > 0 does not occur), we use the fact that the discriminant of 
/ (3) must be positive too: 

a\ - a x a 3 + ~ > 0- (9) 


From (9) we obtain a lower bound for a A by determining the smallest zero by 
an improved Newton’s method. An inequality of Hermite provides bounds 
for a 5 \ 

(35n 5 — 9 a 1 a A 4- 2 a 2 a 3 ) 2 < f(21a 4 — 12a 1 a 3 4- 5 a\) 

x (25a x a 5 — 20a 2 a 4 4- 9a 3 ). (10) 


Further, the three-row principal minor of A must be positive. This is equiva¬ 
lent to 


42a 6 < 12a 3 — 14a 2 a 4 — 28 a 1 a 5 -f 


2«21 a 31 a 32 


a 31^22 “ a 32 a ll 


a ll a 22 — a 2l 


( 11 ) 


Since the four-row principal minor of A must also be positive, we obtain an 
inequality a\ 4- r t a\ + r 2 a 6 + r 3 > 0, where the r u r 2 , r 3 are rational num¬ 
bers depending on a l9 ..., a 5 , a 7 . This gives us a lower bound for a 6 in the 
same way as for a 4 . So at last we have found estimates for all coefficients of 
the polynomials / and must consider only finitely many of them. 

The second important property of Theorem II hinges on the possibility 
of computing the polynomial discriminants by (7). If we compute det A by 
Cholesky’s method, namely, we examine at the same time whether/has only 
real roots. Furthermore, we need the values D f for determining the discri¬ 
minants d F of the fields F generated by some root p of/ D f and d F differ at 
most by a square: 

D f = m 2 d F (me N), (12) 

and D f — d F holds if and only if 1, p, ..., p 6 form an integral basis of 
F — Q(p). 

We give a short description of the corresponding computer program. It 
starts with seven Do-loops in which the possible polynomials/are generated 
as linear integer arrays (a l9 ..., a 7 ). A subroutine determines the polynomial 
discriminants D f . Since many discriminants are greater than 2 31 they do not 
fit into one word on an IBM machine. Fortunately, all D f are smaller than 
10 14 ; we therefore carry out the calculation of D f in double precision. Note 
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that Cholesky’s method is numerically stable. The remaining polynomials/ 
have only real zeros and D f > 0. Then another subroutine tests whether/ is 
irreducible. (The common algorithms are described by Knuth [2].) All re- 
ducible polynomials/are eliminated. Afterward D f is factored into a square- 
free part and squares of prime numbers. Since D f is a double precision 
number, this has to be done in an adequate way by an assembler subroutine. 
There remain 3984 polynomials/with D f = ab 2 (a, b e N, a square-free) and 
a < d 0 . 

The determination of the field discriminants d F corresponding to the D f 
is carried out in three steps. Let D f = ab 2 again, but a not necessarily 
square-free. The task is to examine whether d F = a <d 0 holds. 

(a) We can disprove d F = a in many cases in the following way. If 
d F = a < d 0 holds, there would exist a generation of F in the form F = Q(p) 
with a smaller value of tr(p 2 ) by Theorem I, i.e., there would exist a polyno¬ 
mial //(p) = 0, with a greater coefficient a 2 and a\Dj. This / would also be 
among the 3984 polynomials. If there is no such / we can eliminate / 

(b) By a method of Nagell [3] we determine the powers p m (m e N) of 
prime numbers p < 7, p \ b, which also divide d F . Essentially we construct an 
integral basis of F with respect to these primes. 

(c) For prime numbers p that are greater than the field degree we apply 
a strong method of Zassenhaus [6]. In Z[x] we compute the polynomials 

D 0 = gcd(//') mod p, C 0 =f/D 0 mod p, 

Ci = gcd(/, C 0 ) mod p, £/ = gcd (C 0 D 0 -/, C x ) mod p. 

For Dj = 1 we obtain p \ (D f /d F ) and =f= 1 yields p 2 | (D f /d F ). 

By means of these three criteria we finally get 

Theorem III The minimum discriminant of seventh degree totally real 
algebraic number fields is 

d 0 - 20 134 393 = 71 * 283 583. 

Remark Actually we start our computations without knowledge of d 0 . 
Since only the lower bound for a 2 depends on d 0 , we carry out the program 
for g {0, 1, 2, 3} and the corresponding greatest values for a 2 from (4). 
Then we proceed replacing a 2 by a 2 — 1 until we find a polynomial / the 
roots of which generate a totally real field F t of degree 7 and discriminant 
d v The discriminant d x yields a lower bound for a 2 by Theorem I. Every 
time we find a field of smaller discriminant we get a better lower bound for 
a 2 . In fact d 0 is already obtained after a few steps. 

It remains to show that there is—up to isomorphism—exactly one field of 
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discriminant d 0 . Namely, we obtain 12 polynomials/with D f = d 0 : 

ffx) = x 7 + x 6 — 6x 5 — 5x 4 4- 8x 3 + 5x 2 — 2x — 1 

/ 2 (x) = x 7 + x 6 - 6x 5 - 4x 4 4- 10x 3 4- 4x 2 — 4x — 1 

/ 3 (x) = x 7 4- x 6 — 8x 5 — 12x 4 4- 6x 3 4- 12x 2 — 1 

/ 4 (x) = x 7 + x 6 - 10x 5 - 15x 4 4- 20x 3 4- 45x 2 + 18x - 1 

/ 5 (x) = x 7 4- 2x 6 — 5x 5 — 8x 4 4- 5x 3 4- 6x 2 — x — 1 

/ 6 (x) — x 7 4- 2x 6 — 5x 5 — 8x 4 4- 8x 3 4 lx 2 — 3x — 1 

/ 7 (x) = X 7 4- 2x 6 - 6x 5 - llx 4 4- 8x 3 4- 10x 2 - 4x - 1 

/ 8 (x) = x 7 4- 2x 6 — lx 5 — 12x 4 4- 1 lx 3 4- 10x 2 — 5x — 1 

f 9 (x) — x 1 4- 2x 6 — lx 5 — 13x 4 4- 1 lx 3 4- 14x 2 — 8x — 1 

/ 10 (x) = x 7 4- 2x 6 — 8x 5 — 18x 4 4- 7x 3 4- 23x 2 — x — 7 

/n(x) = x 7 4- 3x 6 — 4x 5 — llx 4 4- 8x 3 4- lx 2 — 6x 4- 1 

/ 12 (x) = x 7 4- 3x 6 — 5x 5 — 17x 4 — 5x 3 4- 9x 2 4- 6x 4- 1. 

Let x, be a root of f (i = 1,..., 12). Then x 5 = xj" 1 is immediate. To prove 
that all 12 fields are isomorphic is rather complicated. Extensive calculations 
by a computer show that they are indeed: 

x 2 = xf 4- xf — 6x 4 — 5x 3 4- 8x 2 4- 4xj — 2 

x 4 = xj — 2 

x 5 = x^ 1 

x 7 = x? 4- x 3 — 6x? — 4x 3 4- 8xf 4- Xj — 2 
x 8 = —xf — x 4 4- 5x 3 4- 4 x\ — 4x t — 2 
x 9 = — x\ 4- 6x^ — x 3 — 8x 2 4- 2x 1 4- 2 
x 12 = —x? 4- 6x? — x 3 — 7x 2 4- 2x t 
x 3 = x 7 — 5x 3 4- 2x 7 
x io = — x 7 4- 6x 4 — 7x 2 — x 7 + 1 
Xu = X 7 4- x 4 — 5x 3 — 5x 2 4- 2x 7 4- 2 
X 6 = xf - 6x 8 + x| 4- 4x1 - 3x 8 . 

Theorem IV There is—up to isomorphism—exactly one totally real alge¬ 
braic number field F 0 of degree 1 and discriminant d Fo = d 0 • F 0 is generated 
by a root of the polynomial x 7 + x 6 — 6x 5 — 5x 4 + 8x 3 4- 5x 2 — 2x — 1. 
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Remark In the meantime I also proved that the galois group of F 0 is the 
symmetric group ® 7 and that the class number of F 0 is 1. For the proofs and 
a system of fundamental units of F 0 , see Pohst [5]. 

All computations were carried out on the IBM 370/165 of the Kernfor- 
schungsanlage Julich. 
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On Absolutely Irreducible 
Representations of Orders t 

WILHELM PLESKEN 
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Absolutely irreducible representation modules L of orders H over dedekind domains R with quo¬ 
tient field K are investigated. In particular a set of representatives of the R-classes contained in the 
K-class of L is described and the finiteness of the class number of L is investigated. The lattice of 
submodules of L is studied and the case where this lattice is distributive is characterized. This 
distributivity is necessary and sufficient for the class number of L to be finite in case the class 
number of R is finite and the residue class fields of R are infinite. The behavior of L under finite 
extensions of R and some examples of Z -representations of finite groups are discussed. 


1. Introduction 

Let R be a dedekind domain with quotient field K and let H be an 
R-order. An H -representation module is defined as an //-module that is 
finitely generated and torsion free as an K-module. Two //-representation 
modules are K-equivalent if they are (//-) isomorphic, and they are K- 
equivalent if their tensor products with K are K//-isomorphic. (We write 
KH instead of K ® R H.) Under certain conditions imposed on R the 

t Supported by the Deutsche Forschungsgemeinschaft. 
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Jordan-Zassenhaus theorem [5, 16] assures that every K-equivalence class 
of //-representation modules splits into finitely many R-equivalence classes. 
In this paper we shall restrict our attention to absolutely irreducible H- 
representation modules. In Section 2 we describe a set of representatives of 
the R-classes lying in a given K-class of such a representation module (see 
Theorem 2.5). We show that a certain local-global principle is valid (see 
Proposition 2.6), which was already established by Maranda [10] in a 
slightly different way. Finally, we give two new proofs for the finiteness of the 
class number, one for general orders and one for group rings RG , where G is 
a finite group. 

In Section 3 we investigate the lattice of submodules of an absolutely 
irreducible //-representation module L In particular we characterize the 
case where this lattice is distributive (see Theorem 3.19) and discuss some 
restrictions on the lattice which can be derived from our proofs of the 
finiteness of the class number. Necessary and sufficient conditions are given 
for the class number of L to be finite. 

In Section 4 we study the behavior of absolutely irreducible H- 
representation modules under finite extensions of the dedekind domain R. 

In Section 5 finally some examples of absolutely irreducible Z- 
representations of finite groups are discussed in order to give some applica¬ 
tions and illustrations of the developed tools. 

The author is greatly indebted to Professor H. Zassenhaus for many 
inspiring discussions and constant encouragement. He hereby wants to 
thank him. 


2 . 


Let R be a dedekind domain with quotient field K and H an K-order. If 
two //-representation modules M and L are K-equivalent, then there exists a 
submodule N of L that is ^-equivalent to M. For let cp: KM -► KL be a 
K//-isomorphism. Because M is finitely generated as /^-module there is an 
r e R,r 0 with rcp(M) ^ L But rq>: M -► rq>(M) gLv ia x -► rcp(x) is cer¬ 
tainly an //-isomorphism. So one can choose N = r<p(M). N has the follow¬ 
ing properties: 

(i) N is a //-submodule of L; 

(ii) L/N is an ( R -) torsionmodule; or equivalently 
(ii') L and N have the same R- rank. 

Definition 2.1 A submodule N of an //-representation module L with 
the properties (i) and (ii) is called a centering of L. The set of all centerings of 
L is denoted by 3(T)- (The term “centering” or “centered lattice” is used in 
crystallography.) 
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We have shown above that a set of representatives of the X-classes into 
which the X-class of L splits can be chosen as a subset of 3(L). We ask, 
When are two centerings M and N of L /^-equivalent? Let cp: M -> N be an 
//-isomorphism. Then <p can be extended to an Xff-isomorphism of KM 
onto KN , i.e., a XH-automorphism of XL, because KM = KN — KL This 
proves the following lemma. 

Lemma 2.2 Two centerings M and N of L are R-equivalent if and only if 
there exists a KH-automorphism q> of KL with (p(M) = N. 

We assume now that L is absolutely irreducible. Then by Schur’s lemma 
the X//-automorphisms of KL are just the multiplications with nonzero 
element of X. If R is a principal ideal domain, this remark and the invariant 
factor theorem suffice to prove the next proposition. 

Proposition 2.3 Let R be a principal ideal domain and L an absolutely 
irreducible H-representation module. A system of representatives of the R- 
classes lying in the K-class of L is given by the set 91 (L) of all centerings of L 
that are not contained in rL for any nonunit r of R. 

Proof Compare Plesken [11] or Plesken and Pohst [12] for special 
cases. The proof follows from the more general Theorem 2.5. QED 

If R is not a principal ideal domain, the most natural approach seems to 
be to work with ideals instead of numbers. 

Definition 2.4 Let L be absolutely irreducible. M, N e 3(L) are called 
ideal equivalent if there is a (fractional) ideal 91 of R with M = 91N. 

Hence X-equivalence implies ideal equivalence and ideal equivalence 
implies X-equivalence. It is immediately clear from the definition that the 
products 91M yield a system of representatives for the X-classes in 3(L) 
where 91 runs through a system of representatives of the ideal classes in R 
and M through a system of representatives of the ideal equivalence classes in 

m 

Theorem 2.5 Let L be an absolutely irreducible H-representation module. 
A system 9I(L) of representatives of the ideal equivalence classes in 3(L) is 
given by all those centerings of L that are not contained in 91L for all integral 
ideals 91 of R unequal to R. If 3 is a set of representatives of the ideal classes of 
the ideals of X, a system of representatives of the R-equivalence classes of the 
RG-modules that are K-equivalent to L is given by {9lM|9le3 and 
Mett(L)}. 

Proof Only the first part of the theorem is left to prove. Let M be a 
centering of L By the invariant factor theorem for X-modules there exists 
element m u ..., m n in L, fractional ideals 5i,•••>&« of X, and integral ideals 
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© l5 ...,©„ of R with © x ^ © 2 ^ 3 ©„ such that 


and 


L = 5l m l ® &2 m 2 © “ * © (in m n 


M = gfiSljm! © g 2 © 2 m 2 ® ***® 5„©„W„ 

(with the © f uniquely determined.) Then 21J" X M is in ©(L) showing that 
©(L) contains at least one centering of each ideal equivalence class. Now let 
M and N be two ideal equivalent centerings of ©(L). Then N = SM for 
some fractional ideal © of R. By using the above decomposition of L and M 
once more, we get 21j = R and ©2^ = R because M and N are in ©(L). 
Hence © = R and N = M. QED 

We can give a somewhat closer description of the set ©(L) if we use 
primary decomposition of the factor modules L/M with M in ©(L). 

Proposition 2.6 Every centering M in ©(L) is the intersection of uniquely 
determined centerings M(©) in ©(L) with ©-power index in L, where © is a 
prime dividing the index of M in L Conversely , given finitely many elements 
M(©) in ©(L) with L : M(©) a ty-power for different prime ideals © ofR , then 
the intersection of the M(©) is again in ©(L). 

Proof Let M e 9?(L). The M(©) are easily obtained from a primary 
decomposition of L/M. Let L= ®" =1 fom* and M = ®? =1 2l f & m t with 
m t e L, 5/ fractional ideals of K and 21 t integral ideals of R with 2 l l 2 21 2 2 
Let 21„ = ©^ ••• ©£ k . Then M(© f ) = fL + M for i= 1, ..., fc. 
Conversely, let M(%) (i = 1,..., k) be centerings in ©(L) of © r power index, 
where the © f are different prime ideals. Let M be the intersection of the 
M(© t ). M is in ©(L) iff the first invariant factor ideal of M in L is equal to R. 
But this factor ideal is the product of those of the M(©,), which are equal to 
R because the M(%) are in ©(L). QED 

Let R v denote the ring of ©-regular elements in K. By standard results 
on localization of rings and modules one sees easily that for any centering M 
of L with ©-power index we can view L/M as an Ry -module such that 

L/M £ 21L/21M and RyM ^ R^IM 

R V H RyH 

for any ideal © of R. Using these remarks. Theorem 2.5, and Proposition 2.6, 
one easily derives Maranda’s results [10] that (in our terminology) two 
centerings of L are ideal equivalent if and only if they are in the same genus, 
i.e., their localizations at all prime ideals of R are isomorphic, and that the 
class number of L is the product of the class numbers of R and those of R^L 
for all primes © of R. It is also clear now that the determination of ©(L) can 
be reduced to that of the ©(K^L) where © runs through all primes such that 
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there is a centering between L and or equivalently such that L/tyL 
becomes reducible as an //-module. Every centering M in <R(L) with 
^8-power index in L is given by R V M n L with RyM in ${(RyL). By 
Theorem 2.7 the other centerings of 91 (L) are found by taking intersections. 
The problem to find $i(R v L) remains. 

Our next result implies that all elements of ©(L) lie between L and clL for 
a certain nonzero a in R. This implies the finiteness of ©(L) if the nontrivial 
factorings of R are finite. This is the Jordan-Zassenhaus theorem for the case 
that L is absolutely irreducible. If L is an //-representation module, then 
there is a homomorphism of H into End*(L), <p L : H -► End*(L) with 
<M*)f = xl for all x e H and / e L. Clearly </> L (//) is a suborder of End K (L), 
which leaves exactly the same submodules of L invariant as H. Furthermore, 
the R-module End R (L)/<j) L (H) is a torsion module. 

Theorem 2.7 Let L be absolutely irreducible and let be the last invar¬ 
iant factor ideal of </> L (//) in End R (L). 

(i) If M is a centering of L with invariant factor ideals © l9 ...,©„ in L, 
then S„ ©f 1 divides 51. In particular all centerings o/©(L) lie between L and 
51 L 

(ii) If © is a prime ideal dividing 51, then either there is a centering in 
5R(L) with ty-power index in L or L/tyL is irreducible but not absolutely 
irreducible as an H/tyH-module. (By tensoring with a suitable extension ring 
this last situation can be reduced to the first.) (The prime divisors of 51 are 
sometimes called critical primes.) 

Proof (i) By the invariant factor theorem there are elements m f e L, 
fractional ideals & of R and integral ideals 51* of R for i = 1such that 
L= ®? =1 &,m f , M= ®" =1 51; m i and 5l f |5l i+1 for i = 1, ..., n- 1. 
Hence we can identify End*(L) with the ring of matrices {(a kl ) \ a kl e $ k 1 g, 
(fc, / = 1, —, n)}. By identifying End^Af) with the corresponding ring of 
matrices one sees that the last invariant factor of End^(L) n End R (M) in 
End r (L) is Sf *©„. Because (f) L (H) is contained in End^(L) n End*(M), we 
get Sr 1 ©^ 151. 

(ii) Let ^|5t. Then those endomorphisms of L/tyL induced by H 
cannot be the full R/ ^-endomorphism ring of L/tyL which is isomorphic to 
(R/ty) nXn . If there are proper H -submodules of L/^SL, the first assertion 
holds. If not, L/^L is irreducible and the image of H in End R/ y(L/tyL) is 
isomorphic to D kxk where D is a skew field containing R/ty properly by the 
remark above. QED 

We remark that the last invariant factor of fi L (H) in Endjj(L) depends on 
the R -class of L If H is the group ring RG of a finite group G such that the 
characteristic of R does not divide | G |, then we can prove by using Schur’s 
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relations [5, 8], that all centerings in ©(L) lie between L and (| G | / n)L , where 
n is the R -rank of L. In Brauer [2] and Brauer and Nesbitt [3], Schur’s 
relations have been used in a similar way. 

Theorem 2.8 Let G be a finite group and L an absolutely irreducible 
RG-representation module , where the characteristic of R does not divide | G |. 
IfM is a centering of L with invariant factor ideals © 1? ..., ©„ (n = R-rank of 
L), then \ G\/n lies in ©„ ©J " 1 . In particular all centerings in 9? (AT) lie between 
N and (j G | /n)N for any N e 3(L). 

Proof As in the proof of Theorem 2.7 let L= ®” =1 5, m i and 
M = ®" =1 ©, m t . Let D be the K-representation of G belonging to KL 
with respect to the basis m u ..., m n . If D(g) = (d^g)) for g e G, then Schur’s 
relations assure that 


H geG 

The invariance of the lattices L and M respectively implies d st (h) g 5 s 1 
and d st (h) e © s 5 S ® f ~ 11 f° r all h e G and 1 < s, t < n. By taking h = g, 
s = U t=j, and h = g~ l , s — j, t = i we get 

\G\/ne djdr u £j l< 5f u Bi % = 8,»7 ^ 

This implies the theorem if we choose i = n and 7=1. QED 

Corollary 2.9 If under the assumption of Theorem 2.8 |G |/n is a unit in 
R , t/ien ©(L) consist only of L 

Corollary 2.10 Under the assumption of Theorem 2.8 the K-class of L 
splits into finitely many R-classes if and only if the ideal class group of R is 
finite. 

It is clear that Theorem 2.8 imposes stronger restrictions on RG- 
representation modules concerning the finiteness of the class number than 
Theorem 2.7 does for H- representation modules. For instance, Theorem 2.8 
implies that the class number of an irreducible C[x]G-representation module 
is 1,'whereas there are certainly C[x] orders H with modules having an 
infinite class number. For instance, 


H = 



0 

1 


xC[x] 


2x2 


has L = C[x] 2 x 1 as a representation module in the obvious way. The class 
number is infinite; in fact those centerings of ©(L) that lie properly between 
L and xL are in one-to-one correspondence with the points of the complex 
projective line. In Corollary 3.9 we shall be able to give necessary and 
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sufficient conditions for an absolutely irreducible //-representation module 
to have a finite 'iH(L). 

3. 


In this section we want to investigate the lattice J( L ) of centerings of L 
Throughout the whole section we assume R to be a principal ideal domain if 
not otherwise stated. (This is legitimate because of the results of the preced¬ 
ing section.) We shall first prove a somewhat stronger version of Brauer and 
Nesbitt’s result that the composition factors of M/pM for M g J(L) are the 
same as of L/pL where p is a prime element of R. We need not assume 
irreducibility for this. 

Definition 3.1 Let M , JV/', JV, JV' be centerings of L with JV/' c= M and 

A/' c= JV. 

(i) The pair (JV/, M) is projectively neighbored to (JV, JV'), if either 

(a) M c= N and M' = M n N' (Fig. 1) or 

(b) NcM and JV' = JV n M'. 



(ii) (JV/, M ) and (JV, JV') are projectively related if there is a chain of 
pairs of centerings (L u L\), (L 2 , L' 2 ), ..., (L k , L' k ) with (L u L\) = ( M , JV/'), 
(L k , L k ) — (JV, JV') such that (L t , L-) is either projectively neighbored to 
(L I+ j, L; +1 ) or there is an a e K with (L l+ 19 L' i+ J = (aL, , aLJ). 

It is clear that “being projectively related” is an equivalence relation on 
the pairs (JV/, JV/') of centerings of L with JV/' <= JV/. If (JV/, JV/') and (JV, JV') are 
projectively related, then M/M' and JV/JV' are isomorphic. The converse is 
not always true. The first fundamental result is the following theorem gener¬ 
alizing the Jordan-Holder theorem for our situation. 

Theorem 3.2 Let L be a (not necessarily irreducible) H-representation 
module with R a principal ideal domain , let pbea prime element in R , and let M 
be a centering of L For any two H-composition series 

Lj — L> L 2 > • • • > L k = pL and JV/ t = M > M 2 > • > M t = pM 

of L/pL and M/pM respectively , / = k holds and there exists a permutation n of 



248 


WILHELM PLESKEN 


the k indices such that (L { , L i+1 ) is projectively related to ( M n{i) , M n(i)+l )for 
i = 1 ,k - 1. 

Proof Any two composition series of N/pN are projectively related in 
the above sense for any centering N of L, which is easily seen from the 
Schreier-Zassenhaus proof of the Jordan-Holder theorem. (Note R/pR is a 
field.) It suffices to prove the theorem under the assumption that L/M is a 
simple PG-module. If M ^ pL, the result is trivial. If M 3 pL, we choose 
L 2 = M and j = pL. Then one sees immediately that (L, M) is projec¬ 
tively related to (pL, pM) and (M, pL) to itself (Fig. 2). The theorem 
follows. QED 


<>L 

"M 

°pL 
ipM 
Fig. 2 


As a corollary we get the well-known result first proved by Brauer and 
Nesbitt under the assumption that H is a group ring [4]. 

Corollary 3.3 Under the assumption of Theorem 3.2 the irreducible con¬ 
stituents of R/pR ® R H-representations belonging to L/pL and M/pM are the 
same. 

This corollary is the basis for the following algorithm developed by 
Plesken [11] to compute 91(L) for absolutely irreducible L (if SR(L) is finite). 
Assuming that A u ..., A k are the irreducible constituent of L/pL for all 
primes p for which L/pL becomes reducible. (These are only finitely many 
primes by Theorem 2.7.) From any M e 3(P) new centerings are obtained as 
the kernels of all possible epimorphism of M onto the A { . Determining these 
epimorphisms as well as their kernels amounts to solving systems of linear 
equations over the field R/pR with p as above. Starting out with this 
procedure from L one has to test for each new centering N whether it lies in 
91(L) (and does not have to compare it with all the previous ones one has 
obtained), i.e., one has to check whether the first invariant factor of N in L is 
a unit. This procedure eventually yields all centerings of 9I(L) (if 9I(L) is 
finite). For details, the reader is referred to Plesken [11] or Plesken and 
Pohst [12]. 

Before we proceed investigating the lattice 3(^) we make two simple 
remarks. 

(1) From the idea of the algorithm it is clear that $R(L) consists of L 
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alone iff for all primes p of R the module L/pL stays irreducible. Let R have 
the property that for all prime ideals of R the factor field has the same 
characteristic as R. If H = RG for a finite group G, then 19t(L) | = 1 
independent of the characteristic of R. More generally, if an order H' over 
any dedekind domain R^R with an absolutely irreducible representation 
module L contains an R-order H with an absolutely irreducible representa¬ 
tion module L such that H' = R' ® R H and L = R' ® R L , then 9?(L) con¬ 
sists only of L in case R is a field. 

(2) The second remark refers to the ideas developed by Schur in [13]. 
We assume R is a dedekind domain. One can predict the Steinitz invariants 
of the centerings of L if one knows the Steinitz invariant of L and the indices 
of all centerings in $l(RyL) for all those critical primes ^ which are not in an 
ideal class of the nth power of some other ideal (n = R- rank of L). If for 
instance the exponent of the ideal class group of R divides the R/ty- rank of 
all irreducible constituents of L/tyL for all critical primes % then all center¬ 
ings of L have the same Steinitz invariant. It would be interesting to find a 
Dedekind ring R , a finite group G, and an KG-representation module L such 
that the Steinitz invariants of all centerings of L are not principal, i.e., the 
matrix representation of KL cannot be transformed into one where all 
matrix entries lie in R. 

We come back to the lattice 3(L) and assume L to be absolutely irredu¬ 
cible of R- rank n and R to be a principal ideal domain. The lattice is 
modular and by Proposition 2.6 splits into the direct product of the sublat¬ 
tices 3 p (L) of the p-centerings, as we want to call the centerings with p- power 
index in L (p a prime of R). For noncritical primes p, the p-centerings of L are 
given by p a L with a e Z-°. Further 3(L) is periodic in the sense that 3(L) is 
the disjoint union of all 219?(L) with $1 running through all integral ideals of 
R. We first want to discuss restrictions imposed on 3(L) by Theorems 2.7 
and 2.8. 

Definition 3.4 The biggest a e Z~° such that p a is the last invariant 
factor of M in N with Af, N e 3 p (L) and M e 9t(iV) is called the critical 
p-exponent of L for any prime p of R. 

We note that Theorem 2.8 yields an upper bound for the critical p- 
exponent in the case H = RG, namely the biggest a with p a | (| G | /n). 
Theorem 2.7, however, does not give such a bound immediately. But using 
Theorem 2.7 ancl the kind of argument used in its proof, one gets 2a as an 
upper bound for the critical p-ex ponent, where p a divides the last invariant 
factor of 4> l {H) in End*(L) but not p a+1 (compare Theorem 2.7 for the 
terminology). Hence the critical p-ex ponent is always well defined. 

Theorem 3.5 Let M, N e 3 P {L) for some prime p of R such that M, N =/= 
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M + N. Then M, N e $R(M -f N). Ifp a and are the last invariant factor of 
M, resp . JV, in M + JV, t/ien a 4- f is smaller than or equal to the critical 
p-exponent of L 

Proof The first statement follows easily if one chooses compatible bases 
for JV and M and uses these bases to construct a basis for JV + M in an 
obvious way. From the second isomorphism theorem we conclude that a is 
the smallest natural number such that p a N £ Af, hence p a N e 9I(M) (Fig. 3). 




Fig. 3 


Clearly p a+/? (M/p“JV) = 0. If p y M c p a JV with y < a + /?, this would contra¬ 
dict the definition of /? as the p-exponent of the last invariant factor of 

JV + M/JV s M/M n AT £ (M/p y - a M)/(M n N/p a ~ y M). QED 

As a first corollary we get a result first proved by Brauer in [2], 

Corollary 3.6 Let H be a group ring RG and p be a prime of R. If 
p\(\G\/n) but p 2 \ (\G\/n\ then the p-centerings are linearly ordered and 
»(0 n S p (L) consists of those M e 3(L) wWi pL £ M < L. 

We want to ask now, How many maximal centerings JV f of JV e 3(E) can 
exist such that the JV/JV f are all isomorphic to each other? 

Lemma 3.7 Let N e 3(E) and let A be an irreducible constituent of L/pL 
Then Horn H (JV, A) is a vector space over End H (,4), the dimension of which is 
bounded by the multiplicity of A in L/pL The centerings M of N with 
JV/M = ® - = j A are in one-to-one correspondence with the k-dimensional sub¬ 
spaces of Horn H (JV, A) (over End H (,4)). 

Proof For q> e Horn H (JV, A) and a e End^A), we get 

a © (p e Horn H (N, A). 

Since A is irreducible, End*(/1) is a skew field. So Horn W (JV, A) is a 
End /? (^l)-vector space. Now the rest of the statements follows easily. QED 

Corollary 3.8 If under the assumption of Lemma 3.7 the field R/pR is 
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finite of q elements ; then the number of the M e 3{N) with N/M = A is of the 
form (i q- 1 )/(q a - 1 )with oc = dim H/p ^ End H (,4) and p < multiplicity of A in 
L/pL 

Corollary 3.9 Let the fields R/pR be infinite for the critical primes p of R. 
Then 9?(L) is finite if and only if for any two N, M e $(L) with N cz M and N 
maximal in M there is no other centering N' of M with M/N ~ M/N'. 

Proof Follows immediately from Lemma 3.7 and Theorem 2.7. QED 

As an important consequence we remark that there is at most one center¬ 
ing M of N with N/M ^ A in case the multiplicity of A in L/pL is one. We 
want to prove that there always exist a centering N of L that does not only 
have the property that there is only one centering M c= N with N/M = A but 
also has M as the only maximal p-centering. We can even do better than this 
by introducing a new type of constituent of L/pL using the ideas of 
Definition 3.1 and Theorem 3.2. 

Definition 3.10 Let A be a centering of L and let N x = N > N 2 > * * * > 
N k = pN be a composition series of N/pN. We call the projective equi¬ 
valence classes of the pairs (N i9 N i+l ) the projective composition factors of 
N/pN (i = 1,..., k — 1). The multiplicity of such a composition factor is the 
number of the indexes i (i = 1, ..., k - 1) with (N i9 N i+1 ) lying in this 
equivalence class. 

It is clear from Theorem 3.2 that the definition of projective composition 
factor does not depend on the chosen centering N. If (JV, M) belongs to a 
projective composition factor of L/pL, then N/M is (an ordinary) composi¬ 
tion factor of L/pL However, (N u A/J and (N 2 , M 2 ) may belong to differ¬ 
ent projective composition factors of L/pL though N x /M x is isomorphic to 
N 2 /M 2 . So in general there are more projective composition factors than 
ordinary ones and the multiplicities of the projective ones are smaller. It is 
clear that both concepts are the same if L/pL does not have repeated (ordi¬ 
nary) composition factors. 

Theorem 3.11 Let (M, M f ) belong to a projective composition factor of 
L/pL Then there is a p-centering N of L such that N has exactly one maximal 
p-centering AT. Further , (N, AT) is projectively equivalent to (M, M'). 

Proof Let N be a p-centering of M with the following properties: (i) 
N £ AT; (ii) there is no p-centering N of M contained in N with property (i). 
By Theorem 3.5 each N with (i) is contained in 9I(M). By Theorem 2.7 there 
exists an N with (i) and (ii) (though it might not be unique). N satisfies the 
assertion of the theorem (possibly after multiplication with an element of K 
to make it a p-centering). Take AT = M' n N. If there were a maximal 
p-centering N" of N with N" f N 9 then N" is a p-centering of M satisfying 
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(i); hence N does not satisfy (ii), which is a contradiction. QED 

We now want to investigate those lattices of centerings that have the 
property that for any projective composition factor and any centering M 
there is at most one centering M' of M such that (M, M') belongs to the 
given composition factor. 

Proposition 3.12 The lattice of centerings of L is distributive if and only if 
it satisfies the following condition: 

(*) For any projective composition factor C of L and any centering M of 
L there exists at most one centering M' of M with (M, M') belonging to C. 

Proof This is an immediate corollary of Birkhoflf’s theorem [1] that a 
modular lattice is distributive if and only if it does not contain sublattices of 
the form shown in Fig. 4. Another proof follows from Lemma 3.16. QED 



Fig. 4 


Proposition 3.13 Let L satisfy condition (*) of Proposition 3.12. If 
M, N 9 AT g 3(L) with N, N' ^ M such that M/N and M/N' have the same 
projective composition factors (with the same multiplicities ), then N = N '. 

Proof We use induction on the number of composition factors of M/N . 
If this number is one, the theorem follows from (*) immediately. Assume the 
theorem is true if M/N has less than m composition factors (multiplicities 
counted!). If there is a centering M' of M, unequal to M and containing both 
N and N' we can apply the induction hypothesis. Let 

M = N 0 >N x >--> N m = N 

M = N' 0 > N\ >" - > N' m = N' 

be composition series of M/N and M/N'. Let i be the smallest index such 
that ( N'i,N ' i+i ) belongs to the same projective composition factor as 
(N ciVj)- Then the composition series of M/N f can be replaced by 
M = Aft > N'[ > N'i > > N^ = N' where N'j = Nn N t for j = 1, 2, 

..., i and N'j = N'j for j = i + 1,..., m. Note that N' i+1 = iVJ n N v Because 
of Ni = Nj the induction hypothesis can be applied. QED 

Proposition 3.14 Under assumption (*) of Proposition 3.12 the projective 
composition factors of L/pL have multiplicity one. 

Proof Let L = L x > L 2 > * * * > L k — pL be a composition series of 
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L/pL Let (Li , L i+1 ) be projectively equivalent to (L, , L j+1 ) with i < j. Then 
there is a chain (N 1 , N \), (JV 2 , 1V' 2 ), • • • > (N s > N ' s ) of pairs of centerings such 
that (N u N\) = (L 4 , L J+1 ) and (N S ,N' S ) = (p a L ji p <x L j+1 ) for some a e and 
(N k , N' k ) is a projective neighbor of (JV k+1 , N^ + 1 ). But (*) implies that 
(AT*, N k + j) or (Ar k+ N k ) (respectively if N k > N k+x or if N k+1 > N k ) is not 
projectively equivalent to (L f , L i+1 ). Hence if M is a centering containing 
all the N k , the multiplicity of (L f , L I+1 ) in M/iV, is the same for all N t . So 
(L f , L l+1 ) cannot be projectively equivalent to (L,, L J+1 ). QED 

Definition 3.15 Assume (*) of Proposition 3.12 and let L = L x > L 2 > 

* > L k+1 = pL be a fixed composition series of L/pL Let M, JV be p- 
centerings of L with M ^ N. We denote M by N(<x u ..., a k ) where a f (i = 1, 

..., k) are the multiplicities of the projective composition factor (L t , L i+ x ) in 
N/M. 

Lemma 3.16 Under the assumption (*) of Proposition 3.12 and in the 
terminology of Definition 3.15 the following hold: 

(i) p*N( a r , ..., oc k ) = N(ct + a lf ..., a + oc k )for a e Z-°; 

(ii) N(a x ,..., a fc ) n ..., ft) = iV(y l5 ..., y k ) with y, = max(a t , ft) 

(i = 1,fc); 

(iii) N( a i> • • •, a k ) H- N(ft, ..., ft) = N(S U ..., ft) with ft = min(a t -, ft) 

(i = 1.*); 

(iv) AT((X 1? ..., oc k )(x^,..., x k ) = N(ot x H- Xj, ..., oc k H- x k ), 

(v) N(a l5 ..., ac k ) e 9l(N) iff at least one = 0;where N(a u ..., a k ), 
N(ft, ..., ft), and N(<x x , ..., a k )(x l9 ..., x k ) are p-centerings of N. 

Proof (i) and (iv) are trivial. We prove (ii) and (iii), i.e., y t = max(a f , ft) 
and Si = min(a f , ft). From the second isomorphism theorem we see im¬ 
mediately that the first set of equations implies the second. To proof the first 
equations one proceeds similarly to the proof of Proposition 3.13 by using 
induction on l a . + l ft. QED 

Our next aim is to describe the biggest R-order in End k (KL) that leaves 
all the centerings of L invariant. Of course it is given by P| M £ W(L) End /l (M) 
where we view the End R (M) as orders of End k (KL). We shall prove that this 
is a graduated order as introduced by Zassenhaus [14] iff 3(L) satisfies (*) of 
Proposition 3.12. 

Definition 3.17 Let R be a local Dedekind ring with prime element p. A 
graduated order over R is a suborder of R nXn containing n orthogonal 
primitive idempotents e i9 with e x +• + £„ = /„ (= unit matrix). 

We give a short summary of the properties of graduated orders [14]. 

Theorem 3.18 Let A be a graduated order contained in R nXn with primi¬ 
tive idempotents e u ..., e n (and R local). A can be transformed in such a way 
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that the e { become the diagonal matrices diag(0,..., 0, 1,0,..., 0) with the 1 in 
the ith place; and further there are natural numbers n l9 ..., n k with 
n x + • • • + n k = n and a fj * e Z~° u {oo} (i — 1,..., k) such that a^- = oo only if 
i ^ j and ol u = 0 and 

A = {(p*‘Ja ij )\a ij eR"‘* n >} 

where p 00 stands for zero. The a £j - satisfy the obvious conditions 

<*ij + 0tji>0Cn (Uj,k= 1 

which are necessary and sufficient for A to be closed under multiplication. 
KA = K nXn iff all otij e Z~°. In this case the submodules ( centerings ) of R n x 1 
invariant under left multiplication are given by 

I IP* ia A I 

K nX1 ( a i> •••, a*)= j I : with a y ,dj > a f a } e R n >* 1 for j = 1,.. k\. 

I \P Xk a k J | 

If R is a nonlocal principal ideal domain, then a suborder of R nXn is 
called a graduated order if it becomes a graduated order by localization at 
all primes p of R. Theorem 3.18 can be generalized in an obvious way for this 
situation. From Theorem 3.18 we see that the centerings of an irreducible 
graduated order form a distributive lattice because the rules of Lemma 3.16 
are fulfilled. We also have compatible bases for the centerings of R n x 1 in the 
sense of Zassenhaus [17], i.e., one gets a basis of each centering by multi¬ 
plying the elements of the basis. 


T 


'o' 


'o' 

0 
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0 

0 

* 

0 

, ..., 

6 

6 
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i 


by certain elements of R. We can now characterize the modules satisfying (*) 
in Proposition 3.12. 

Theorem 3.19 Let L be an absolutely irreducible representation module of 
an R-order H where R is a principal ideal domain. The following statements are 
equivalent: 

(i) L satisfies condition (*) in Proposition 3.12. 

(ii) The lattice 3(L) of centerings of L is distributive. 

(iii) For all prime elements pofR the multiplicity of the projective compo¬ 
sition factors of L/pL is equal to one. 

(iv) There exists a system of compatible bases for the centerings of L 

(v) The maximal suborder of End R (L) leaving all the centerings of L 
invariant is a graduated R-order of K nXn . 



ON ABSOLUTELY IRREDUCIBLE REPRESENTATIONS OF ORDERS 


255 


Proof In Proposition 3.12 we proved already that (i) and (ii) are equiva¬ 
lent By Proposition 3.14 statement (i) implies (iii); the converse of this is 
trivial. By Theorem 3.18 the last two statements are equivalent and also by 
Theorem 3.18 one sees that (v) implies (i). We show that (i) implies (iv). In 
Zassenhaus [17] necessary and sufficient conditions for the existence of com¬ 
patible bases for systems of submodules of a finitely generated abelian group 
are given. The results of Zassenhaus [17] can easily be generalized to finitely 
generated modules over principal ideal domains in an obvious way. To 
prove (iv) we then have to check the following conditions for 3(L) = Z: 

(a) 3(L) is distributive. 

(b) For any prime p of R , let Z p be the set of all factor modules M/p*L, 
where a is the critical p-exponent of L and M e 3(L) with p a L ^ M. 

(bj) n m 2 ) = p^m x n p fi m 2 for all p e Z~° and m i9 m 2 e'L p . 

(b 2 ) Let m e Z, m! — £l ( p v m ") n m and define (p _v , Z)m to be the 
intersection of all m" e Z with p v m" = m'. For any v e Z~° the operator 
(p _v , Z) has to satisfy the (p _1 , Z)(p _v , Z )m = (p“ (v+1) , Z )m for all m e Z. 

Condition (a) is satisfied because of Proposition 3.12. One easily checks (b^ 
by means of Lemma 3.16. We check (b 2 ). Let m = L(a 1? ..., (x k )/p a L e Z. If 
v > a we get (p“ v , Z)m = p a L/p a L. If v < a, one sees easily (p" v , Z )m = L(f u 
..., jS k )/p a / with pi = max(v, a f ) — v. (b 2 ) now follows easily. QED 

The following application of Theorem 3.19 which is already partly 
proved in Zassenhaus [17] might be of interest. 

Corollary 3.20 Let R be a principal ideal domain, L a finitely generated 
free R-module , and let 931 be a finite set of submodules of L of the same R-rank 
as L. Then there exist compatible bases for L and the submodules in 9JI if and 
only if the lattice SR is distributive, where SR is generated by L and the elements 
of 93? by taking intersections, sums, and by multiplying with elements of the 
quotient field of R such that the resulting modules are contained in L 

Proof Let H = End^L) n End R (M), where End R (M) is con¬ 

sidered to be an order in End K (A:M). Then H is an R -order and L an 
absolutely irreducible //-representation module with 9J? ^ 3 (l). That the 
existence of compatible bases implies the distributivity of 93? was already 
observed in [171. (This can also be obtained by applying Theorem 3.19.) As 
for the converse we apply a slight generalization of Theorem 3.19. One sees 
easily that the distributivity of 93? implies condition (*) of Proposition 3.12 
for 931. From this one concludes that the results in Propositions 3.13 and 3.14 
and Lemma 3.16 are also valid for 93?, as well as the version of Theorem 3.19 
for 93? instead of 3(L). If at some place the finiteness of the critical p- 
exponents is used, the arguments are still valid, since these exponents are 
also finite for 93? because of 93? ^ 3(L). Now this version of Theorem 3.19 
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shows that there is a system of compatible bases even for all modules in SR. 
This finishes the proof. We remark, however, that one can conclude now by 
using Theorem 3.18 that 3(L) and SR are equal. QED 

The next corollary of the proof of Corollary 3.20 can be considered as a 
partial converse of Theorem 2.7. 

Corollary 3.21 Let R be a principal ideal domain on L, a finitely gen¬ 
erated free R-module. Let SR be a finitely generated (in the sense of Corollary 
3.20) lattice of submodules of L that have the same R-rank as L Then the 
maximal suborder H of End/^L) leaving the modules in SR invariant has the 
same R-rank as End R (L) and L is an absolutely irreducible H-representation 
module. (It might happen that SR c= 3(E).) 

The next application of the ideas developed in this chapter can already 
be found in Plesken [11]. 

Theorem 3.22 If L/pL has two projective constituents each with multi¬ 
plicity one , then there is a centering M of L such that the p-centerings in S1(M) 
are linearly ordered. M can be chosen to be any centering of L such that M/pM 
is indecomposable. 

Proof The proof follows from Theorem 3.11 and Lemma 3.16. In the 
notation of Definition 3.15: 


3 P (L) n H(M) = {M(0, 0), M( 1, 0),..., M(<5, 0)} 

or {M(0, 1),..., M(0, 8)}. QED 


We finish this section with a complete list of distributive lattices of 
p-centerings of L, where L/pL has three composition factors, and the critical 
p-exponent a of L is less than or equal to three. So we have the combinatorial 
task of finding all graduated orders as described in Theorem 3.18 with / = 3 
and ctij + CLji < 3. Note that max, + a M ) is the critical p-ex ponent of L If 
the critical p-ex ponent a is one, the centerings are linearly ordered. If a = 2, 
there are four possibilities and for a = 3 seven possibilities: The maximal H 
order leaving the centerings invariant can be transformed to 


Hfaiu a 23 > a 3i> a 32) — 


1 

«n 

a \2 

«13 

I 

1 

p ail a 2l 

P* 31a 31 

a 22 

P** 2 a 32 

a 33 _ 

| fly 6 R n ‘* n i 


with (a 21 , a 23 , a 31 , a 32 ) equal to (1,0, 2, 1), (1,0,2,2), (2,0,2,2), and (2,1,2, 
1) for a = 2, and (1, 0, 3, 2), (1, 0, 3, 3), (2, 0, 3, 2), (2, 0, 3, 3), (2, 1, 3, 2), 
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(3,1,3,2), and (3,0,3,3) for a = 3. The class numbers are 4, 5, 6, 7, 6, 7,8,9, 
8, 11, 10. The lattices are shown in Fig. 5. 


L(0, 1,2) 
pL(0, 0,1) 



H(1,0,2,1) H(1, 0, 2, 2) 


etc. 


H(2,0,2,2) H(2,1,2,1) 

Fig. 5 


H(1,0,3,2) H(1,0, 3, 3) 







4. 


We want to make some comments on the behavior of centerings if one 
replaces the Dedekind domain R by a Dedekind domain R' containing R 
such that the quotient field K' of R' is a finite field extension of K . Let 
H' = R' ® R H and L — R' ® R L for any //-representation module L In 
Zassenhaus and Reiner [18] it is proved that L\ = L 2 iff L x = L 2 , in case R is 
local If L is absolutely irreducible, a somewhat more precise description of 
the situation can be given. 

Theorem 4.1 Let L be absolutely irreducible. 

(i) [18] The mapping 3(L) -* 3(£)- M -> R' ® M = M' is injective. 
9?(L) is mapped into W(L). 

(ii) Let ^ be a (critical) prime ideal of Rand let the centerings o/9l(L) 
all contain a L Let s #' be an unramified prime ideal of R' dividing The 
mapping Q>: 3^(L) n 91(L) -► 3v(^) : M-»M' + is injective. Assume 
that the constituents of L/tyL are absolutely irreducible. $ is a bijection if and 
only if R/ty ^ R'/ S j3' or if the lattice 3$(L) of ^3- centerings of L is distributive. 

Proof (i) Let M, N e 3(L) with Af' = AT. Choose r e R such that 
rM ^ N and denote the invariant factor ideals of rM in N by 21 1? ..., 2l„. 
Then (rM) = rM' ^ N' with invariant factor ideals 21 x R \..., 2l„ R'. Part (i) 
now follows easily. 

(ii) The injectivity of <I> follows from the proof of part (i) of this theorem 
and from Proposition 2.6. To prove the last statement we first note that the 
maximal ^'-centerings correspond to certain one-dimensional subspaces of 
certain R'/ s ]B'-vector spaces of homomorphisms as described in Lemma 3.7. 
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The maximal ^-centerings of M correspond to the one-dimensional sub¬ 
spaces of the corresponding R/Si P-vector spaces from which the R'/^'-spaces 
under consideration can be obtained by tensoring with R'/ty'. Now it is easy 
to see that the R'/^'-subspaces are in one-to-one correspondence with the 
R/^-subspaces iff R/ty = R'/ty' or if all the spaces are one dimensional, i.e., 
the lattice of ^-centerings is distributive. QED 

Corollary 4.2 If under the assumption of Theorem 4.1 is ramified with 
ramification index a , then the number of ty'-centerings in W(L) is greater than 
or equal to 

a * (number of ^-centerings in W(L) — 1) + 1 
and the critical ^'-exponent of L is greater or equal to 
a • (critical ^3 -exponent of L). 

If Me 3*(L) n $R(L) such that no proper centering of M has this property , 
then M' is also minimal in 3$ (£) n nn 

Proof The last statement follows immediately from Theorem 4.1 (ii) 
and the definition of 9?(L). To prove the first result let Af, N e 3<p(E) with 
M <= N maximal and let R v — (93') fl U. Then 

N'/M' z> W(N'/M') => * * * 3 (<P') fl (N'/M') 

where all inclusions are proper. The inverse images of these modules are 
different ^'-centerings of £. 

The result follows now from Proposition 2.6 and an easy counting 
argument. QED 

Distributivity of lattices of centerings is not always preserved under 
ramified extensions if the projective composition factors are not in one-to- 
one correspondence with the (usual) composition factors. For example, the 
dihedral group of eight elements has an irreducible Z-representation of 
degree 2. The lattice of 2-centerings is linearly ordered and the constituents 
of L/2L are the 1-constituents. (D 8 is a 2-group!) By taking this representa¬ 
tion over the gaussian integers Z[f], the lattice of (1 + i)-centerings becomes 
nondistributive (Fig. 6). By Corollary 3.8 there are three maximal 
(//^-centerings of M' + (1 + i)L. Hence 3(i + o (^) ls not distributive. 



Fig. 6 
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5 . 


We want to consider some examples of absolutely irreducible ZG- 
representations, where G is a finite group. 

Let G be a 2-transitive permutation group of Q = {1, n}. Let 

/ = ® ? = ! ZE t be the representation module of the permutation representa¬ 
tion, i.e., gEi = E gii) for g e G and i e ft. The factor module L = //Z Yj= i E i 
is an absolutely irreducible representation module by Burnside’s lemma [8], 
Denote the coset of E ( by e t . In case G = S n , the constituents of the factor 
modules L/pL are well known [6] for all primes p : 

(a) L/pL is irreducible for p\n. 

(b) L/pL has two constituent for p \ n. (One of the constituents is the 
1-constituent.) 

In [9] it is proved that (a) and (b) also hold for G = A„ or certain of the 
Mathieu groups. 

Theorem 5.1 Let G be a 2-transitive permutation group such that (a) and 
(b) hold (for instance G = S n , A n ). Then the set of representatives 9I(L) con¬ 
sists of all centerings L d of L for d \ n , where 

L d = (X"= i 1 a i e t I d divides «<}• 

The index of L d in L is d. Hence the class number of L is the number of the 
divisor of n. 

Proof Because of (a) there are no p-centerings for p\n. Let G act 
trivially on Z/nZ. Then L„ = ker q> with (p:L-*Z/nZ via 
L"= i Yj =i ■ (p is well defined because of Yj= \ e i = 0 and is a ZG- 

homomorphism. For d \ n, we get L d as the inverse image of d ■ (Z/nZ) under 
<p. Hence the invariant factors of L d in L are 

h •••, h d. 

n — 2 

So L d e 9I(L). To show that there are no other centerings in 91(L) we make 
use of (b) and apply Theorem 3.22. We only have to show that L/pL and 
(applying Proposition 2.6) L^/pL,, are indecomposable for p\n. So let 
9 Jj= i * t e, = X?=i « t e, (mod p), hence 

a 9 (i) -«!=•■• = a 9(n) - a„ (mod p) 

for all g e G. By using the 2-transitivity of G, one gets a x = a 2 = = 

oi n (mod p), hence Yj= i = 0 ( mod P L \ i UpL is indecomposable. The 
indecomposability of L„ / pL^ follows immediately from the observation that 
the modules L and L„ are dual (via the quadratic form of L fixed by G). We 
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indicate another proof: L„ is generated by e t - e } ( i, j=l,...,n,i± j). Let 
<P'- L, , Z/pZ be a ZG-homomorphism, where G acts trivially on Z/pZ. Be¬ 
cause of the 2-transitivity of G all <?, — ej are mapped on the same element of 
Z/pZ. For odd p this implies immediately <p = 0 because <p(e, - ef\ = 
q>( e j - ef If p = 2, one gets 3(p(e t - efj = </>(e, - = 0 from (e x - e 2 ) + 
(e 2 ~ e 3 ) +(e 3 - e 1 ) = 0. Hence <p = 0 and L„/pL„ is also 
indecomposable. QED 

For G = S n , Theorem 5.1 was already stated without proof in [7]. A 
similar proof as above can be found in [11] also for G = S„. 

For our next example let G be an irreducible subgroup of GL(n, Z) 
containing all the diagonal matrices of GL(n, Z). It is clear that G is a 
semidirect product of the elementary abelian 2-group D„ of all diagonal 
matrices of GL(n, Z) of order 2" and a transive group P„ of permutation 
matrices of degree n. 

Theorem 5.2 Let G = D„P„ as described above and L= Z " x 1 . Then all 
centerings o/91(L) lie between L and 2L and are in one-to-one correspondence 
with the normal subgroups of G contained in D n and unequal 1. 

Proof Let e u ..., e„ be the standard basis of L and let e u (i,j = 1 
be the matrix units of Z" *" with e tJ e k = S Jk e t . We want to show that 2e u are 
contained in the (Z-Enveloping algebra <p(G) of G. Let g e P„ with ge t = e,. 
Then 


g - diag(l,..., 1, -1, 1,..., \)g = 2e i} e </>(G). 

T 

j 

Hence the last invariant factor of 0(G) in Z"*” is 2. By Theorem 2.7 all 
centerings of 9?(L) lie between L and 2 L One easily sees now that D n and 
L/2L are equivalent G-modules where G acts on D n by conjugation. QED 

Corollary 5.3 If P n in Theorem 5.2 is cyclic , the centerings of L in 9?(L) 
are in one-to-one correspondence with the divisors f 1 of (x n — 1) in Z/2 Z[x], 

Corollary 5.4 If P n in Theorem 5.2 satisfies the conditions (a) and (b) at 
the beginning of this section , then the class number is three . The centerings of 
L=Z nX1 in W(L) are L, N = {£?., a,e,\2 divides J* ml a,} and 
M = i a, e { | a, = a ; (mod 2) for i, j = 1,..., n}. 

Note that for even n the 2-centerings of L in (5.4) are linearly ordered: 
L => N => M 2L ■ ■ •. L/N and M/2L are 1-constituents. This is an example 
that projective constituents and ordinary constituents differ from each other. 

As a further example we mention that it is proved by Zassenhaus [15] 
that for absolutely irreducible representation modules L of metacyclic 
groups the lattice of /^-centerings is linearly ordered and the ^-centerings of 
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$R(L) lie between L and pL. (It is proved that (f> L (G) is a hereditary order.) 
As a last example we take the 2-group of 8 x 8 matrices given by 



where A = [? “£] and B = [° £]• Note that by Theorem 2.8 the only nontri¬ 
vial centerings of L = Z 8 * 1 are 2-centerings and that for p-groups there is 
only one modular irreducible representation. The centerings in this example 
were computed electronically by M. Pohst and myself by using the program 
written for Plesken and Pohst [12]. The lattice of 2-centerings is shown in 
Fig. 7. The class number is 32. But one can easily construct examples of 
2-groups of degree 8 with higher class numbers. 



I 

I 

Fig. 7 
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The Existence of P-adic Abelian L-functions 

CARY QUEEN t 

UNIVERSITY OF CALIFORNIA 
BERKELEY, CALIFORNIA 


We first prove that a modular form on T 0 (]f ), r > 1, is a p-adic modular form. Using this fact , we 
establish the existence of a p-adic Lfunction for each abelian character of any totally real number 
field. 

Introduction 

It is a well-known but remarkable fact that the values of the Riemann 
zeta function at negative integers are rational. This phenomenon suggested 
to Kubota and Leopoldt the possibility of defining a “p-adic zeta function” 
on Z p by p-adic interpolation between these values [6]. More generally, they 
proved that for every Dirichlet character there is a continuous p-adic 
L-function L p (x , s) on Z p such that L p (x , 1 — n) — L(x , 1 — n) for positive 
integers n = 0 mod(p — 1). The significance of this result was enhanced by 
Iwasawa’s later discovery of the connection between p-adic L-functions and 
cyclotomic extensions [3]. 

It would naturally be of interest to extend the concept of p-adic Lr 
functions to any algebraic number field. This might be possible for at least 


t Present address: Department of Mathematics, Cornell University, Ithaca, New York. 
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totally real number fields because then L K (x , 1 — n) is an algebraic number 
[5], not always zero. However, the project has been stymied by the lack of 
congruences for L K (x , 1 — n) analogous to those for L Q ( 1, 1 — n)= —B„/n. 
Coates and Sinnott did recently treat real quadratic K using an explicit 
formula of Siegel [2]. 

J.-P. Serre created a new approach to the question when he introduced 
the concept of “p-adic modular form” [10]. Siegel had earlier noticed that 
L K (x, 1 — n) is the constant term of a modular form. Serre’s L p (x , 1 — 5 ) 
turns out to be the constant term of a p-adic modular form. However, 
because he relied on a theorem that modular forms on r 0 (p) are p-adic 
modular forms, Serre’s treatment was limited to characters of extensions 
K(t p )/K. 

The object of the present paper is to prove the existence of a p-adic 
^function associated to any abelian character x of any totally real number 
field. The basic tool in our proof is a theorem that any modular form on 
r 0 (p r ), r > 1, is a p-adic modular form.t In order to be fully general, every¬ 
thing must be done in the setting of p-adic modular forms of level N , a 
generalization of Serre’s original idea due to Katz [4]. 

In Section 1 some results of Katz are rapidly redone in the style of Serre 
in order to provide a convenient starting point for the present work. 

In Section 2 the idea of a p-adic modular quasiform is introduced. This 
technical notion is precisely what is needed to prove our basic theorems. The 
proofs in this section are just slight variations of proofs of Serre [9, 10], but 
the results seem genuinely stronger. 

In Section 3 we show directly that certain modular forms on a group 
‘T 0 i(p 2 )” are p-adic modular quasiforms. These forms are then used to show 
inductively that modular forms of level N on r 0 (p r ) are p-adic modular 
forms of level N. 

In Section 4 the previous results are extended to modular forms on T 1 (p r ) 
having characters with respect to r 0 (p r ). In particular, it is shown that the 
higher the conductor of the character, the more rigidly is the constant term 
determined. 

Finally, we present a proof of the existence of p-adic abelian /^functions 
and discuss their continuity at 5 = 1. 

I would like to thank J. Coates, N. Katz, J.-P. Serre, and especially my 
advisor A. Ogg for help and encouragement. 

1. Preliminaries 

For any prime p, let Q p denote an algebraic closure of Q p , so Q p is a 
compositum of the field of p-adic numbers Q p and the field of algebraic 

f We have been informed that N. Katz has also obtained this result. 
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numbers Q. Fix once and for all an extension v p of the p-adic valuation to 
Q p , normalized so v p (p) = 1. If K is an extension of Q in Q, let 0 K be its ring 
of integers and K its residue field. Should K be finite over Q, write 
K p = KQ p for its closure in Q p . 

Let G be any subgroup of T = SL( 2, Z) which contains (o ?), and put 
q = e 27tIZ/N . a modular form / on G is said to be defined over K if the 
^-expansion of / at zoo is 0 a n q n with each a n e K. If in fact /is defined 
over (9 K , i.e., each a n e 0 K , then/= a n q n e K[[q]\ is called the reduc¬ 
tion of /. Denote by M K (G , k) the vector space over K of (holomorphic) 
modular forms on G of weight k defined over K , and by Jt K (G, k) the 
K-subspace of K[[q]] consisting of the reductions of all forms in Jf K (G , k) 
defined over (9 K . Finally, let Jt K (G) be the subalgebra ^k(G, k) of 
K[[q]] and J% k (G) the corresponding subalgebra ^*(G, k) of K[[q]]. 
For short, Jt^G, k) will be abbreviated by Jf{G , fc), ^q(G) by Jt(G\ and 
so on. We take as our starting point two fundamental theorems. 

Theorem A (Swinnerton-Dyer) Ifp=f= 2, 3, then 

•*(n * f p [q, p]/(a -1) 

w/zere Q am/ R are indeterminates and A(Q, R) is the polynomial such that 
£ p _i = A(E 4 , E 6 ). If p = 2 or 3, t/zezz Jf(r) ~ F p [A] with A algebraically 
independent over F p . 

We write r p (X®=o a n q n ) for inf„> 0 v p (a n ) and {*} for the least integer greater 
than or equal to x. 

Theorem B (Serre, Katz) Let / (1) = aj, 1 ) q n andf (2) = a { n 2) q n 
be members of J?(T(N), k x ) and Jf(T(N) 9 k 2 ) respectively. Suppose r e Q, 
{p, N) = 1, and v p (f (1) -/ (2) ) > v p (f (1) ) + r. Then: 

(1) tf P i= 2 and r > 0, /cj = k 2 mod (p — l)p {r} ~ 1 ; 

(2) if p = 2 and r > 1 9 k x = k 2 mod 2 {r}-2 . 

The proof of Theorem A is the same, mutatis mutandis , as the proof of 
Theorem 2 in [12], with Q replacing Q and F p replacing F p . 

To prove Theorem B, we may assume that v p (f {1) ) = 0 after multiplying 
/ (1) and / (2) by an appropriate constant. Suppose / (1) and/ (2) are defined 
over a number field L with maximal unramified subfield K, so K = L and 
(V Kp is the Witt ring W(K). Let w 1? ..., w„ be an integral basis for L p over K p , 
i.e., a basis for the module (9 Lp over the ring 0 Kp . Since Jt(T(N ), k) always 
has a basis of forms defined over Q, there are K p -linear combinations ff of 
modular forms over Q such that / (1) = Yj w jf ( / ) an d / (2) = Yj w jf ( j 2) - 
Choose WjEK p such thatw=w,-, so for each j, v p (wj — Wj)> 0 and 
— /i' 2) ) — M* As an eas y consequence, if f {i) = Yj ™jfT> th en 
v p (f {1) -f (2) )> [r] for [r] = r and v p (f {1) -f i2) )>[r] for [r] / r. Since 
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f {1) —f (2) is defined over the unramified field K p , in either case 
v P (f a) —/ (2) ) > M- The theorem now follows by applying Corollary 4.4.2 of 
[4] to/ (I) e S(W(K), 1, N, kf 

In all that follows, by a limit of a sequence of series f {i) = a n )( f > 

a ( f e Q p , we shall mean a series / = such that lim^^ a :J,° = 

uniformly for all n. Moreover, p will always denote a prime not dividing a 
fixed positive integer N, and K will denote some finite extension of Q. By a 
p-adic modular form f over K of level N, we mean a limit of a sequence 
f (l) e Jt K {T(N\ ki). It follows immediately from Theorem B that the k t 
approach a limit k in X = Z p x Z/(p — 1)Z, which is independent of the 
particular sequence of modular forms f (i) approaching / and is called the 
weight of/. The set of p-adic modular forms over K of level N and weight k is 
a K p -vector space Ji K (T(A/, k) closed under limits. Moreover, the p-adic 
modular forms of level N and weight k over all finite extensions of Q consti¬ 
tute a vector space over Q p , denoted <Jf(r(N), k). Any member of 

Jt{T(N\ k) can be called a p-adic modular form of level N, without regard 
to its particular field of definition, and p-adic modular forms of level 1 can 
simply be called p-adic modular forms. 

Two important corollaries may be deduced from Theorem B. 

Corollary C Let f= X„°°=o a n <f e ), k) with k = (t, s) e Z p x 

Z!(p — 1)Z. Then: 

(1) if s ± 0, v p (a 0 ) > inf„> t a n ; 

(2) if s = 0, v p (a 0 ) > inf n£ 1 a„ - v p (t) - 1 - <5 2 , 

Corollary D Let / (i > = ^=o a ( fq" e J/ K (r(N), k t ); i= l, 2, .... Sup¬ 
pose that the af tend uniformly toward a„ € Q p for n > 1, and the k t approach a 
nonzero limit k in X. Then the a'd’ have a limit a 0 eQ p , and 

f=Z? =0 a n q"Ejt K (r(N),k). 

Corollary C follows directly from Theorem B upon approximating/by 
/ (1) g Jt(r(N), f), letting / (2) — a 0 e 0), and observing that 

v P {f {1) —f (2) ) = i v p (a n ). For the proof of Corollary D, first recall that 
the Eisenstein series E k == 1 mod p m if k = 0 mod(p — 1 )p m ~ 1 (p 2) and 
E k = 1 mod 2 m if k = 0 mod 2 m “ 2 . Since K is a finite extension of Q, K p is 
complete. To show that the atf converge, it therefore suffices to show that 
they form a Cauchy sequence. Since lim,^ fc, = /c = (t, s) / 0, either s f 0 
and ^ f 0 mod(p — 1) for i large enough, or t f 0 and v p (ki) is bounded. By 
Corollary C, the v p (a { ^) are thus bounded, and upon multiplying all the f {i) 
by a constant, one may assume that v p (f (l) ) > 0 for each i. By assumption, if 
z'i and i 2 are large enough, inf„> x v p (a ( ^ 2) — a^ x) ) > M and 
k i2 = k it mod(p — l)p M “ l for given M. If k i2 > k ix , say, then Corollary C 
applied to f ii2) — £ kii _ k . 2 f ih) e ^(T(N), k i2 ) yields v p (a { ^ 2) — a ( o°) > M if 
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k i2 ^ 0 mod(p — 1) and v p (a { o 2) — a { o l) ) > M — v p (k i2 ) — 2 if k i2 = 0, as 
desired. Finally,/e Jfc K (T(N), k) by the definition of p-adic modular forms 
over K of level N and weight k. 

Our first two propositions show that no essentially new reduced or 
p-adic modular forms arise when the field of definition is extended. Con¬ 
sequently, a reduced or p-adic modular form is defined over K precisely 
when its coefficients are all contained in K or K p . 

Proposition 1 Jfc K (r(N), k) = M Q (T(iV), k) (x) Q K. 

Proof Let w l5 ..., w„ be an integral basis for K p over Q p . If 
/g Jt K {T{N\ fe), let / e k) have reduction / Then / = JL Wj f } 

with fj g ^ Kp [[^]] a Q p -linear combination of members of ^ Q (T(iV), k). 
Since each /• may be approximated by a member of JV Q (r(N), k ), 
fj e M Q (r(N), k) and/= Yj Wjfj e «% Q {r{N), k) ® Q K. 

Corollary ///= ^°° =0 e Jt{T(N), fe), then fe j( K (T{N), k) if and 
only if each a n e K. 

Proof Only the “if’ assertion requires proof. By the proposition, 
/ — Yjj c j?j fj g Jf Q {T(N\ k) linearly independent and Cj e F^. Since/ 
and the / have all coefficients in K , the Cj are also contained in K , say by 
Cramer’s rule, so fe Jf K (F(N), k) as desired. 

Proposition 2,£ K {T(N\ k) = Jf Q (T(N), k) ® Qp K p = J/ Q (T(N) y k) ® Q K. 

Proof It is enough to show that if/= lim^^ / (i) with f (i) e Jt K (T(N), fe,), 
then / is a K p -linear combination of members of J^ Q (r(iV), fe). Let w 1? ..., w„ 
be a basis for K p over Q p and write/ = JL fjWj,f {i) = ]T ff w j 9 where the/■ 
and // have coefficients in Q p . The// are Q p -linear combinations of modu¬ 
lar forms over Q, hence elements of ^q(T(N), fe,). Recall that the norm 
defined by || oljWj || = |a 7 | p is equivalent to the tynorm on K p . Thus 

Y.- f { pWj-> Y,- //Wy forces A 0 to approach /.• for each L that is, 
/• G ^ Q (T(iV), fe) and / = X; / w, e ^ Q (T(iV), fe) ® X p . 

Corollary 1 // /= ^=0 a n q n e fe), then fe Jt K (T(N\ fe) */ 

and only if each a n e K. 

Corollary 2 A p-adic modular form with coefficients in Q p is a p-adic 
modular form in the sense of Serre. 

Proof Corollary 1 is proved in the same manner as the corollary to 
Proposition 2, and Corollary 2 is of course a special case of Corollary 1. 

Consider the linear operators U , V: Cl p [[q ]] -► & P [[q]] defined by 
ta n cf\U= Ya pn q n and Y a »<f\ v = Ja n <T. 

n = 0 n = 0 n — 0 n = 0 
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Proposition 3 ///= o a n q n e Jt K (Y(N\ k), then 
f\UJ\VeJ( K {T(N\ k). 

Proof By Corollary 2.1, it suffices to show that V and V map 
k) into itself. Since U and V commute with limits, this would 
follow if U and V mapped k) into k) for k e Z. There is a 

decomposition 

Jf(r(N), k)= ® Jt(T(N), X , k) 

X e (ZJN Z)** 

where 


x, k) = \fe Jt{T{N\ k)\ f\ k 


a b 
c d 

for 


= X(d)f 

e T, b = c = 0 mod n\. 


a b 
c d 


Hence, all one needs is/ | l/,/1 V e <y#(r(iV), k) for/ g Jt(r(N), x , fc). Write 
k m = (p — 1 )p m_1 and/ m = £ km /G ^(T(iV), k + fc m ), sok m -+0 and f m -+f 
as m-> oo. By the theory of Hecke operators, f m | T p =f m \ U + 
X(p)p fc+lc " , ~ 1 / m | K e Jt(T(N), k + k m ). Therefore 

/ | U = lim/ m | U = lim f m \ T p e k\ 

completing the proof for U. In particular, f m \V = x(p)p l ~ km ~ k (f m \T p ~ 
f m \u)£ k -h k m ) and /1 V = lim f m | V e ^(r(V), fc), completing 

the proof for V. 


In [10], Serre shows that general Hecke operators T t and differential 
operators © and R h , as well as U and V , preserve p-adic modular forms of 
level 1. As Serre has pointed out to us, © and R h also preserve p-adic 
modular forms of level N. For it follows from the transformation formula for 
E 2 that if/is a modular form on any subgroup of T, in particular T(N), then 
so is ©/ — (wt f/12)E 2 f while E 2 e 2). The situation for the 7] is more 
complicated, however. 


2. p-adic Modular Quasiforms 

For later purposes, a version of Corollary D will be needed which applies 
to certain sums of p-adic modular forms. As a first step in this direction, we 
define the relevant sums precisely. 

Definition / = ^ n Q n E * s a P-^ddc modular quasiform over 

K of level N and weight k e Z p if / = f a with f a e M K (T(N ), (k, a)). 
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Again, a p-adic modular quasiform of level 1 may be called a p-adic modular 
quasiform. 

The ring of modular forms Ji{T) of course has a Z-grading by weight. 
Similarly, the ring of reduced modular forms Jt{T) has a 
Z/(p — l)Z-grading if p 2,3, because by Theorem A, Jt{T) ~ 
F p [Q, R]/(A — 1), and E p _ j = A(E 4 , E 6 ) has weight p — 1. To put it another 
way, 

p -1 

Jt{T) = © Jr*(r) with = X ^g( r ’ k \ 

a = 1 k=a(p-l) 

Proposition 4 If f = } / a is (2 p-adic modular quasiform , 

"p(/) = inf a ”*(/«)■ 

Proof Observe that since ^(T, /c) = 0 if 2 | /c, only the/ a with a even 
can be nonzero. In particular, when p = 2 or 3, a p-adic modular quasiform 
is simply a p-adic modular form, and the proposition is trivial. Otherwise, let 
n be a uniformizing element of a field of definition K for /. Clearly v p (f) > 
inf a v p (f a ). After multiplying/by an appropriate power of n , the proof of the 
reverse inequality amounts to showing that if each v p (f a ) > 0, and if v p (f) > 
lv p (n) for a nonnegative integer Z, then each v p (f a ) > lv p (n). This statement is 
trivially true if Z = 0, so to prove it by induction on Z, assume it is true for 
Z = m. Since the ^ a (T) are linearly independent, one sees after approximat¬ 
ing each/* by a modular form that v p (f ) > (m + 1 )v p (n) > 0 implies each 
v p (f a ) > 0. Hence n~ x f = Y* n l f* is a quasiform with t? p (7r _1 /) > mv p (n) 
and t? p (7i _1 / a ) > 0, so by the inductive hypothesis v p (n~ */) > mv p (n), i.e., 
v P (fa) > (w 4- 1 )v p (n) for each a. 

Corollary The representation of a p-adic modular quasiform f as a sum 
/«, L ejr(t 9 (k, a)), is unique. 

Proof It suffices to show that if 0 = £«/*» then each / a = 0, and this is 
the special case v p (f) = oo of the theorem. 

Theorem A assures the validity for (T) of Serre’s theory of filtration 
and the 0-operator [9]. In particular, if p ± 2, 3, the filtration of © p “ 2 (E p+i ) 
is p 2 — 1. For the rest of this section, q denotes e 2niz . 

Proposition 5 If k e N and 

(P-I)l*. 

21 k, p = 2, 

$ = i>*-i(»y^(r). 

«= 1 


rZzcn 
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Proof By elementary congruences <f> p — = $ where 

$ = i& p - 2 (E p+ 1 ) 

if p f 2, 3 and ij/ = — A if p = 2 or 3. The equation <f> p — (f> = — A can clearly 
not be solved in ,/#(T) = F P (A), proving the proposition if p = 2 or 3. Other¬ 
wise, assume f> = with e Jf a (r). As pot = a mod (p - 1), 

— <f> p ~ f> = Z* $£ ~ $a) with ^ ■ // a (r). Since ip has filtration 

p 2 — E <1>S — <Pa = 0 for a p — 1 and $> p — (f> x = ifr for a = p — 1. But if 
$ P -1 has any filtration h, , and t - $ p _, have filtration ph, which 
cannot equal p 2 — 1, a contradiction. 

It is now possible to establish an analogue of Theorem B for p-adic 
modular quasiforms. The proof utilizes the Eisenstein-Serre series 
E* £ 7/ q (F, k) defined in [10], which have the property that 
v p(E*h, o) — 1) = v P (h) + 1. 

Theorem 6 Suppose / (1) = f { a l) and / (2) = /< 2> are p-adic 

modular quasiforms of respective weights k 1 and k 2 , with v p (f tl> — f i2) ) > 
v P (f (1> ) + r, r > 0. Then i; p (&, — k 2 ) > r — 1 ifp f 2 and vJki — k 2 ) > r — 2 
if P = 2. 

Proof Assume that K is a common field of definition for/' 1 ’ and/ <2) 
with uniformizer n. If p = 2 or 3,/ <1) and/ <2) are actually p-adic modular 
forms so, by Theorem B, nothing remains to be done. If p f 2, 3, after 
multiplying / (1) and / <2) by an appropriate power of n, one may assume 
v p (f a) ) = 0, whence each v p (f ( a l) ), v p (f ( x 2) ) > 0 by Proposition 4. If k 2 > k u 
let h = k 2 — k 2 and m = v p (h) 4- 1, so the assertion is m > r. If this were 
not the case, i.e., r > m, then the inequalities v p (f (l) — / <2) ) > r and 
Vp(E* h< o) - 1 )>m applied to the equation / (1, £ ( % 0) -/ (2) = (f ,v - f ,2> ) + 
f (1) (E*h, o) - 1) give first v p (f w Ef h , 0) -/ (2) ) > m and then 

p- m (f (1 'Ef h . 0 , -f (2) ) = p-fHoi - 1) mod 71. (1) 

Applying Proposition 4 to the p-adic modular quasiform/ (1, £J 0) —/ <2> of 
weight k 2 yields p~ m (f iU E* 0) -/ <2 >) = J JI g aL ,g x e ^(T), r p (^) > 0. The 
equation (1) may now be rewritten 

I - 1) mod n. (2) 

Since the reduction of a p-adic modular form obviously equals the reduction 
of some modular form, and since 

= ^(E*^l) with v p (X) = 0, 
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(2) shows that $ = X Ya g a /Ya f a is in the quotient field of Jt{T). But by 
Proposition 5 and its proof, (f> p — $ = e Jt{T) and <f> J(( r), while 

according to Section 1.2 of [10], Jt{T) is integrally closed. This contradiction 
to the assumption m<r establishes the theorem. 

Corollary 1 If / (1) = Ya /< 1} and / (2) = Ya f {2) are p-adic modular qua¬ 
siforms and me N, then v p (f {1) — / (2) ) > m implies v p (f i f ) — f {2) ) > m for each 
a. 

Proof Again, the corollary is obvious if p = 2 or 3, and one may assume 
v p (f {1) ) = 0 otherwise. If/ (1) and/ (2) have weights k x and k 2 , then Theorem 
6 gives h = k 2 — k t = 0 mod p m ~ \ so / (1) £ ( *, 0 > — / (2) = / (1) — / (2) = 
0 mod p m . Since / (1) £J p 0) -/ (2) = £, (fi^Efa 0) -/i 2) ) is a p-adic modular 
quasiform of weight /c 2 , Proposition 4 now gives/< 1) —/i 2) = f ( a l) E^ 0) — 
f (2) = 0 mod p m . 

Corollary 2 77ie limit of a sequence of p-adic modular quasiforms 
f (l) — Ya /i° AT of weight k { is a p-adic modular quasiform f over K of 
weight k — lim k t . 

Proof For each a, the f { f converge to some f a e /^[[g]] by Corollary 1 
and the completeness of K p . Since each / a e Jt K {T, (fc, a)),/ = Ya /« * s a 
p-adic modular quasiform of weight k. 

Corollary 3 Iff = Y?=o 15 a p-adic modular quasiform of weight fc, 
then v p (a 0 ) > inf„> 1 v p (a n ) - v p (k) - 1 - <5 2 , p . 

Corollary 4 Let / (l) = V b e p-adic modular quasiforms over K of 

weight k i9 i = 1,2,.... Suppose that the a{, l) tend uniformly toward some a n for 
n > 1, and that the k t approach a nonzero limit k in Z p . Then the atf have a 
limit a 0 , and f — u n q n is a p-adic modular quasiform over K of weight k. 

Proof The proofs of Corollaries 3 and 4 are analogous to those of 
Corollaries C and D, with Theorem 6 replacing Theorem B. 

We note that Theorem 6 can be deduced immediately from its Corollary 
1. However, there is no obvious way of proving Corollary 1 directly. It does 
seem necessary to use the same methods Serre used to prove Theorem B 
once again. 

3. Modular Forms on r 01 (p r , N) 

Throughout this section, we fix a positive integer N, let p be a prime not 
dividing IV, and write (a, b : c, d) for the matrix [“ $]. For each r e N, we 
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define three congruence subgroups of T by 

r 0 (p r , JV) = {(a, b : c, d) e T(JV) | c = 0 mod p r }, 

r 0 i(p r , JV) = {(a, b : c, d) e T(JV) \ c = 0 mod p r , a = d = 1 mod p}, 

r x (p r , AT) = {(a, b:c,d)e T(JV) \c = 0 mod p r , a = d = 1 mod p r }. 

We abbreviate ToC//) = r 0 (p r , 1), T 01 (p r ) = r 01 (p r , 1 ), and r,(p r ) = 

r 1 (p r , 1). There are isomorphisms r 0 i(p r_1 , iV)/r 01 (p r , N)-* Z/pZ, 
(a, b : p r_1 c, d)\->c, and r 0 (p r , N)/r 0l (p r , N)-> (Z/pZ) x , {a, b : c, d)\->d, 
where any integer is considered a member of Z/pZ by the natural map. Since 
Z/pZ and (Z/pZ) x are abelian, there are two incompatible decompositions 
of the space of modular forms on r 01 (p r , N ): one according to characters 
from r 01 (p r -\ JV), and the other according to characters from T 0 (p r , JV). 
Henceforward, £ will always denote a character of Z/pZ, extendible to an 
additive character of Z, and x will denote a character of (Z/pZ) x , extendible 
to a Dirichlet character of modulus p. Then 

•*(T 0 i(P r , JV), fc) = ® ^(r 01 (p r , JV), k) (r > 2) 

$e(Z/ P Z)* 

J((T 0l (f,N),k)= © ^(T 01 (p r , N), x, k) (r>l). 

X € (Z/pZ) x * 

Here ^(r 01 (p', JV), /c) = {/e .#(r 01 (p', JV), fc) s.t. / | k (a, b : p'~ ‘c, d) = 
{(c)/ for (a, b : p r ~ l c, d) e r 01 (p r_ *, /V)} and .#(r 01 (p', JV), z , fc) = 
{/e ^(r 01 (p r , JV), k) s.t./| k (a, b : c, d) = x(d)/ for (a, b : c, d) e r 0 (p r , JV)}. 
Analogous decompositions of course hold for ^#(T,(p r , AT), fc). 

The goal of this section is to prove that a modular form of level JV on 
T 0 (p r ) is a p-adic modular form of the same level and weight, i.e., 
J/(r 0 (p\ N), k) c: (i k , k)). A reasonable strategy for doing this 

would be first to prove it directly for forms on T 0 (p, JV) and certain Eisen- 
stein series on T 0 (p r , JV) having characters with respect to r 0 (p r_1 , JV). The 
Eisenstein series could then be used to inductively reduce the theorem from 
T 0 (p r , JV) to T 0 (p, JV). However, since T 0 (p r , JV) is not a normal subgroup of 
r 0 (p r_1 , JV), the desired characters do not exist. So instead we apply the 
strategy to show inductively that forms on r 01 (p r , JV), which is a normal 
subgroup of r 0 i(p r_1 , JV), are p-adic modular quasiforms of level JV. The 
decomposition of ^(r 01 (p r , JV), k) with respect to characters from r o (p r , JV) 
may then be employed to prove the desired result. 

Proposition 7 Any modular form on r 0 (p, JV) of weight k is a p-adic 
modular form of level N and weight (k, k), i.e., ^ K (T 0 (p, N), k) cz 

*(r(jv), (k, k)). 
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Proof Choose integers a and y such that ap — yN 2 = 1, let m = pN, and 
write W = (pa, N : my, p) so det W = p. Set A = (p, 0 : 0, 1), 
S j = (1, Nj : 0, 1), T = (pH, N: Ny, 1), 

r°(p, N) = {(a, b :c, d)e T(N) \ b = 0 mod p}. 

Write/1 for/ \ wtf and/ | U* for £f =I / M' 1 |S 2 , so/11/* = p 1 ""'" 2 /1 U. 
We claim that if /e ^(r 0 (p, AT), /c), then (/ | +/| C/*) e Jt{T{N), k). 

First,/ j -4 _ 1 g ..^(r°(p, N), k) because 

(1/p, 0 : 0, l)(x, py: z, w)(p, 0:0, 1) = (x, y : pz, w). 

Second, the p + 1 matrices T, S J (1 < j < p) form a complete set of right 
coset representatives of r°(p, N) in l /V). Indeed, a trivial calculation shows 
that they lie in different cosets, while the set T ( N ) r° (p, N) injects into 
r/r°(p), which has p + 1 elements. Putting these facts together, 

f\W+f\U* = (f\A~ 1 )\T+ £ (/ e k), 

J=i 

as desired. Moreover, another simple calculation shows that W normalizes 
r 0 (p, N), so that if f e . if (r o (p, N ), k). then / | W ~ 1 g ,//(r o (p. N), k) and 
(f+f\W- l \U*)eJt(T(N),k). 

For a fixed even integer / > 4 with (p — 1) | /, let E = E t and g = E — 
pi' 2 E | w 6 J?(r 0 (p, N), k). Then g = 1 mod p and g | W~ 1 s 0 mod p 1 +,/2 . 
In fact, 

E | W = E | (pa, N :my,p) = £ | (a, N : Ny, p)(p, 0:0, 1) 

= £ | (p, 0: 0, 1) = p ,/2 £ | V 

since (a, N : Ny, p) e T, and similarly £ | W ~ 1 = p ,/2 E | V. Because 
£ = 1 mod p, g = E — p'£ | V = 1 mod p, and 

g\W~ l = (E - p ,l2 E\W)\W~ l = E \ W~ 1 - p ,/2 £ 

= p' /2 (£ | F - E) = 0 mod p 1 +,/2 . 

Finally, if/G ^#(r 0 (p, N), k), define f m = fg pm + fg pm \ W ~ 1 1 U* for each 
m g N. Since fg pm is on r 0 (p, N),f m is on r(A') by the remarks above. To 
prove the theorem, one needs / m -*/ as m -> oo. Write k m = k + lp m so 
fm = fy pm + P 1 ~ kJ2 (f9 pm ) 1 1 1 U. fg pm -(-/because g = 1 mod p, and it re¬ 
mains to show (p 1 ~ km/2 )fg pm | W ~ 1 1 L/ —► 0. Indeed, 

Mp 1 “ W2 /iT| IF- 1 1 C) > ^((p 1 "w 2 )/^| W~ l ) 

= v p (f | IF" 1 ) + p-Xte | IF" 1 ) + 1 - k m /2 
> v p (/ | IF - *) + 1 — k/2 + p m -» oo. 
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From now on, let co denote the unique character of (Z/pZ) x such that 
r p (ft)(x) - x) > 0 for all x £ Z. Clearly co generates the character group 
(Z/pZ) x *, that is, co 1 , co 2 , ..., co p ~ 1 is a complete list of characters of 
(Z/pZ) x . We always consider each of to be a Dirichlet character of modulus 
p, so in particular co p ~ 1 (n) = 0 if p\n, and the L-series L(s, co p “ 1 ) = 
(1 - p~ s )C(s). In the future, when discussing modular forms on T or r o ,(p r ), 
q will denote e 2niz . But when these forms are used in conjunction with forms 
for T(iV), they should be viewed as series in q N with q = e 2n,zlN . 

Proposition 8 Any modular form on r 01 (p, N)of character co' and weight 
k is a p-adic modular form of level N and weight (k, k + y), i.e., 

J? K (r 0l (p, N), (D\ k) C Jt K (r(N), (k, k + y)). 

Proof First note that the series 

G k (of) = G k (q, of) = L(l-k,of) + J £ (sg 11 v]af(v)v k ~ 1 q n 

n= 1 v\n 

lies in Jt(r 0l (p), of, k), as shown by its expression as a sum of Eisenstein 
series of level p. It follows directly from Theorem 3 of [1] (see Lemma 13 of 
this paper) that if (p — l)|(fc + y) and p / 2, then r,,(L( 1 — k, of) = 
Vp{B k ,j/k) = -1 - v p (k). Hence, in that case 

E k (of) = L{ 1 - k, ofy l G k (of) = 1 mod p »W +i . 

Up = 2, Jt K (r 0l ( P , n), to\ k) = J? K (r 0 (p, n), k) c jt K (r( n), (k, k)) = 

J? K (r(N), (k, k + y)) by Proposition 7. If p f 2, suppose 
/e J? K (r 01 (p, N), co y , k) and let k m = p m (p - 1) + y. Then 

/= lim m ^ fE km (a y ) 

while 

fE km (a y ) £ F 0 i(p> N), coW, k + k m ) = Jt K( ^A r o(P’ k + k m ). 

Thus, / is a p-adic modular form of level N and weight k + y = 
lirn m ^ 00 (k + k m ) by Proposition 7. That/is in fact defined over K as a p-adic 
modular form of level N follows from Corollary 2.1. 

We now commence the second part of our strategy by considering cer¬ 
tain Eisenstein series. In all that follows, summations such as and £„=„ 
will be over both negative and positive integers v excluding 0. 

Proposition 9 For any integer k > 2 and any character c of Z/pZ, 
H k (Z) = a k (p) + Y X £(-n/t>)(sgn v)v k ~ 1 q n e ^(r 0 i(p 2 ), k), 

n = 1 v\n 
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where x k (p) = A t (p) X*=hp) n k and X k {p) = p^(k - l)\/{-2nif. Also 
«k(p) g Q(C P ) and if 

(p-l)\k, pf 2 

2| k, p = 2, 

then v p (oc k {p)) = -1 - v p (k). 

Proof The Eisenstein series of level p, 

G k ( t, c, d, p) = a k (c, d,p)+ X X ( s g" v ) vk ~ ‘CpV 

n= 1 u|n 

nfv = c (p) 

are modular forms on T(p) [8], Here r = e 2nizlp , ( p = e 2, ' i/p , ot k (c, d, p) = 0 if 
c 0 mod p, and a k (c, d, p) = A k (p) X*=<Hp) if c = 0 mod p. Further¬ 
more. G k (x, c, d, p)\L = G*(r, (c, d)L, p) for L e T, and G k (q, c, d, p) — 
P~ kl2 G k ( t, c, d, p) | (p, 0:0, 1). Hence, G k (q, c, d, p)|x, y: pz, w) = 
G k (q, cx + dz, dw, p) if (x, y : pz, w) e r 0 (p). In particular, if d — 1 and 
(x, y: pz, w) e r 01 (p), then G k (q, c, 1, p) | (x, y : pz, w) = G k (q, c + z, 1, p). 
Therefore 

Z Z(- c ) G k(q, c, 1, p)| (x, y : pz, w) = £(z) X Z(-c)G k (q, c, 1, p), 

c- 1 c = 1 

i.e., Zf= i £(—c)G k (q, c, 1, p) e ^(r 01 (p 2 ), £, fe). The first assertion of the 
theorem now follows upon observing that in fact 

H k (i) = H k (q, {)= £ S(~c)G k (q, c, 1, p). 

C — 1 

To show a k (p) e Q(C,,). let tr be an automorphism of C over Q(£ p ). Since 
H k (£) e Jt^Tfp 2 ), k) and „#(T,(p 2 ), k) has a basis over Q, 
H k (iff e -jM (Tj(p 2 ), /c). Thus H t ({) — H k (fy is a modular form on r,(p 2 ) of 
weight k. But since all coefficients of If k (c) except possibly the first lie in 
Q(C P ), H k (f) — H k (ff = a k (p) — a k (p) a , a modular form on r,(p 2 ) of weight 
0. This is impossible unless a k (p) a = a k (p). 

It remains to evaluate v p (A k (p) Zn=i ( P » n ~ k ) when 

(p-l)|fc, pf 2 

2| k, p = 2. 

X n~ k = 2(1 — 2~ k )£(k) 

n=l (p) 


If P = 2, 
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and 

vMk(P) Z »“*) = v p ((2 k - l)BJk) = -1 - r p (fc). 

« = 1 (P) 

If p j= 2, 

00 00 j even 

Y »"* = Z ”'* = (2/(p- 1 ))Z Z ^(n)n~ k 

n=l (p) n=l «=1 1<J<P-1 

n = ± 1 (p) 

j even 

= (2/(p-l)) z 

1<J<P-1 

since co 2 , co 4 ,..., a > p ~ 1 is a complete list of characters of (Z/pZ) x /{+1} and k 
is even. Now, on the one hand L(/c, co p ~ 1 ) = (1 — p~ k )C(k) so 
v p (A k (p)L(k, co p_1 )) = i; p ((p k - l)B k /2k) = - 1 - i> p (fc). On the other hand, if 
j =£ p — 1, there is a functional equation A k (p)L(k, w*) = t(ay)L(l — k, cu 7 ) 
with t(ay) = Yjx = I a Gauss sum. Since 

G k (oy) = l(i — /c, ay) 

+ X Z o> / (»)(sgn V e Jf(r 01 (p), ay, k) 

n=l t?|« 

cJt(T 9 k-j) 

by Proposition 8, Corollary C gives i; p (L(l — k, a) v ) > 0. Putting these facts 
together, 

j even 

t ’p(4(p) Z n ~ k ) = v p (L(k,( 0 p ~ 1 )k k (p)+ Y h(p)L(k, of)) 

n = 1 (p) 1<7<P-1 

= - 1 - f p (k). 


We shall now show that H k (£) e Jt(r 01 (p 2 \ k) is a p-adic modular quas¬ 
iform in a way that is consistent with the decomposition 

^(r 0 i(p 2 ), /c) = ^(r 01 (p 2 ), x, k). 

Proposition 10 Let k > 2, { e (Z/pZ)_* and //*(£) = YPl Hffi) with 
Hl(Z) e J!(r 0l (p 2 ), co\ k). Then Hl(f) e Jt{T, (k, k + y)). 

Proof For any f e Jf(r 01 (p 2 ), k), the ^-component of / is 

P = (i Up - 1)) ZS=i «>W I R * where R * e r - 

R x = (x -1 , 0 : 0, x) mod p 2 . 

In particular, 

(p - 1 )#£(£) = Y ® v (*) Z i(~ c ) G k(q, c, 1 , p)|k x , 

X=1 c=l 
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of after writing £(1) = and performing a short calculation: 

(P ~ !)#*(£) = Mp) E ® y (n)n~ k + E E E ® y (x)Cp‘ m " /I,,x (sgn r)r* _ V 

« n=l t;|n x=l 

(3) 

This series will be shown to be a p-adic modular form of weight ( k , k + y) by 
expressing it as a sum of the known p-adic modular forms discussed below. 
We have seen that 


G k (co l ) = L(l - k, co 1 ) + £ £ ct/(*;)(sgn V 

m= 1 t>|n 

is a p-adic modular form of weight ( k , k + /') by Proposition 8. Note that if k 
and i do not have the same parity, G k (co*) is identically zero. If R h is Serre’s 
differential operator, 

G k (a>‘, ftp) = G*(q, to 1 ', ftp) 

= <*?) |^(0, j) 

= E E "'OVMsgn »K _ V 

n=l y|n 

is a p-adic modular form of weight ( k, k + i + 2j). Finally, if 
1 < c, d < p — 1, 

G k (c, d) = G k (q, c, d) 

OO 

= E E (sgn v)v k ~ 1 q n 

n= 1 v\n 
n=d v=c(p) 

= (p - f)~ 2 E E <3‘(c)c5y(rf)G t (q, w\ ftp) 

.=i j=i 

is a p-adic modular quasiform of weight k. 

First, suppose that y ± p - 1. Then = 0 if 

v — wm/t; = 0 mod p, and 

p- i 

X ftP(jc)C<r mB/t,) * = ftP(ft - mn/v) r(cP) 

X= 1 

if v — mn/ft ^ 0. Applying this fact and the functional equation of L(k, ft) 7 ) to 
(3) gives 

(p - l)t(ftP) - l Hl(£) = L(1 - k, ftp) + £ E wy ( v ~ mn/v)( sgn w)p* _ V- 

n = 1 t>|n 

This expression can be written as the sum of three series, each of which is a 
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p-adic modular form of weight (/c, k 4- y). Indeed, 




L(1 - k, a> y ) + Z X T ( v > 


w=l u|« 
p|« pji> 


X I T(v, n)q" 


n— 1 i?|n 
P J n 


+ 


Z X t ( l ’’ «)<?" 

n = l t?|w 

Pi" Pi” 


where 7’(r, n) = co y (t> — mn/t>)(sgn v)^ *. The first term is 

L( 1 - fc, to”) + Z Z « v (^)(sgn t>K~ V = W fc (cw T ) | U | F, 

p|n i>|/i 

a p-adic modular form of weight (k, k + y). 

The second term is 

£ co v (c — md/c)G k (c , d) = X! co y (c — md/c) £ <S l (c)<^(<i)G fc (a)\ ay). 

c, d = 1 c, d=l i, J= 1 

The coefficient of a particular G*(a/, co J ) in this expression is 

p- 1 p-1 

X X C ° V ( C — ^nd/c)co l (c)o^(d) 

c — 1 d= 1 

= X X ^ y ( c — wic 2 d/c)a> l (c)ay(c 2 rf) 


c — 1 d=l 
p- 


p -1 


= X 1 2j (c) X ^(l “ md)(o j (d ), 


c= 1 


d = 1 


and this is zero unless i + 2j = y mod (p - 1). Hence, the second term is a 
linear combination of those G k (co l , co 7 ) with weight (k, k + y). 

The third term is 


co y ( — m) Z Z m y (n/i;)(sgn v)v k l q n 

n= 1 v\n 
p\n p\v 

= to y (-m)f j Z Z « r («/^)(sgn t>K“Y 

f = 1 »= 1 f|n 

p\n vp(v) = t 

= (P ,, ' : “ 1) )( Z Z tu y (n)cD y (i>)(sgn uK - y)| F 7 

t = 1 \n= 1 v\n I I 

= m y (-m) J p , <* I_1) G k (a> 1 ’, a/)| F 7 . 

i>=i 
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Since G k (a> y , (o y ) is a p-adic modular form of weight (k, k — y + 2y) = 
(k, k + y), so is the third term. 

Now suppose y = p — 1. Then 6) y (x)C;~ mnlv)x = p - 1 or -1 as 

(t) — mn/v) = 0 or ^ 0 mod p. It follows easily that 


(P - 1 )Hl(S) = - 
+ P 


4(p)p k Z n k + Z Z ( s s n v W V 

n n= 1 p|« 


Z Z (sg" v W V 

n=l t?|n 
pjn v—mnlv = 0 


+ p 


4(p)p 1 Z 


+ Z Z ( s s n »)«* Y 


«= 1 v\n , p|u 

y— mn/v = 0 


The first term is zero if k is odd and an ordinary Eisenstein series if k is 
even—certainly a p-adic modular form of weight (fc, k). The second term is 

pZ Z G k(c,d) = p(p- l)~ 2 Z Z Z © i (c)s>'(d)G lk (&) i , (o’), 

C= 1 d= 1 C — 1 <1=1 i,j= 1 

c — md/c = 0 c — mdjc = 0 

in which sum the only G k (a>\ a^) having nonzero coefficients are such that 
i + 2j = 0 = y mod(p — 1). Finally, the third term equals 


pZ (p' <k_1> ) 


^*(p)p *(i - p k ‘) Z n k + Z Z ( s s n v W V 


n= 1 t;|n 
pi v 


V 


= p£ p tik ~ l) G k (w p ~ l ) I r G jt(r 9 k) = jr{r 9 (*, k + y)), 


completing the proof. 

For any character f and even integer k > 2, let £ k (£) = a k (p)~ *//*({). 
Then £_ k (£) = l/£ fc (^) is a meromorphic modular form on r 01 (p 2 ) of weight 
- k . Moreover, £_ k (£) = £L k (£) with 

El k (i)= (p- l)' 1 IS:} d) y (x)£_ k (0|^ 

a meromorphic modular form on r 01 (p 2 ) of character a> y and weight —k. 
Each £-*(£) has a finite-tailed Laurent expansion in q , valid in some neigh¬ 
borhood of 100 . 


Proposition 11 Let £ be a character of Z/pZ and let k > 2 be an integer 
such that 


(p - l)|fc, p # 2, 
2 1 /c, p = 2. 
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Then the q-expansion of each EL k (£) lies in Q[[g]] and is a p-adic modular form 
of weight (—k, —k + y). 

Proof Write E k for E k (£), so E k = 1 -f p 1 a n with a n e Q(( p ) and 
v p (a n ) > 0, by Proposition 9. Therefore, Ef’ 1 approaches a limit in Q p [[g]] 
as m -► oo, and that limit is = 1/E k . But by Proposition 10, E and thus 
E pm ~ 1 are p-adic modular quasiforms. Corollary 2 of Theorem 6 hence 
implies that is a p-adic modular quasiform of weight — fc, i.e., 
E_ k = £ EL k with EL k e Jf(T, (-/c, -k + y)). On the other hand, we 
already know that = £ E y - k with E y - k a Laurent series representing a 
meromorphic modular form on r 01 (p 2 ) of character a> y and weight —k. 

By Proposition 10, E k itself equals £ E y k with E y k e Q[[<?]] both a modular 
form on r 01 (p 2 ) of character co y and weight k, and a p-adic modular form of 
weight (fc, k + y). The fact that (Xn = i ^k 1 )(Zv 2 ~=i £- 2 *) = 1 is equivalent, by 
the corollary to Proposition 4, to a system of p — 1 linear equations in the 
p — 1 “variables” E y ^ k , namely 

X Ej'E y J k = (1 < P<p - 1). 

71 + 72=0 (P~ 1) 

And the fact that (X?f=i £ k')(Xn=i £ t 2 ) = 1 is equivalent, by linear 
independence of forms with different characters, to the same system of linear 
equations in the variables E y J k , namely 

X El'E'Jii = p _! {I < P < p — l). 

71 + 72 = 0 (P~ 1) 

Since the theorem will be proved if EL k = EL k , it suffices to show that 
the constant term of the series represented by the determinant of the system 
of linear equations is not zero. The determinant is clearly a cyclic determin¬ 
ant with first row (/? = 1) (££“ \ E v k 2 ,.... E' ). By Eq. (3), the constant term 
of E y k is X k (p) Yj, a> y (n)n~ k /(p - l)a k (p). Hence, up to a nonzero factor, the 
constant term of the determinant is given by a cyclic determinant with first 
row (X„ cD p_1 (n)n“\ ..., X« oj l (n)n^ k ). The value of a cyclic determinant 
with first row (a„ a 2 ,..., aj is f \? = , (X”= i ajCJ.) [7], or in the present case 

n(i 1 E® , (»)«-^'- 1 ) = (p-ir 1 n( z 

i= 1 \j= l n J x=l \n = x(p) f 

We are now ready to prove the main theorem of this section. 

Theorem 12 Any modular form on T 01 (p r , N) of character co y and weight 
k is a p-adic modular form of level N and weight (Jc, k + y), that is , 
N), co y , k) <= ^ K (r(JV), (k, k + y)). 

Proof By Corollary 2.1, it is only necessary to show 
Jf(T 01 (p r , N), oo y , k) <= Jf(r(N), (k, k + y)). In fact we shall prove by indue- 
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tion on r the equivalent statement S(r ): If /e Jf(r 01 (p r , N), k) and 
/ = £y=i f y with f y e Jt(r 0l (p r , N), co y , k), then each 

P e Jf(T(N), (k, k + y)). 

For r = 1, this is Proposition 8. 

It follows from a trivial matrix computation that if 
g(q)eJ/(r 01 (p r ),£,k), then g(q p ) = g(q)\Ve J?(r 01 (p r+1 ), £, k); and if 
g(q) e J?(r 0l (p r ), a) y , k), then g(q) \ V e J?(r 01 (p r+1 ), co y , k). Consequently, 
E k {q p \ £) e ^#(r 0 i(p r+2 ), £, k) and E y k {q p \ Z)eJt(T 01 (p' +2 ), co\ k). Si¬ 
milarly, {) € J^(r 01 (p r+2 ), {, k) and 

E-k(q pr , €) £ j^(r 0 i (p r+2 ), (o y , k ) 

where refers to meromorphic modular forms. And of course E y k (q pr , £) and 
EL k (q pr , £) are still p-adic modular forms of respective weights (k, k - by) and 
(-k, -k + y). 

Another simple observation is needed. Suppose that 
fi e Jpr 0l (p r+l , N ), k t ), f 2 e .e/(F 01 (p r+1 , N), k 2 ), 

h =/i / 2 e ^(r 01 (p r+1 , N), p + k 2 ). 

Suppose further that/! = £ /J with f\ e ^(r 01 (p r+1 , N), co\ p) a p-adic 
modular form of level N and weight (k u k { + y). Finally, suppose that if 
f 2 = Yj Pi with Pi 6 ^(r 0 i(p r+ \ N), oj l , k 2 ), then the ^-expansion at ice of 
each fl is a p-adic modular form of level N and weight ( k 3 , k 3 + y ). Indeed 
(k 2 , k 2 + y). Then we claim that if/ 3 = X fl with 

P e ^(r 01 (p r+1 , N), co y , k 3 ), 

each fl is a p-adic modular form of level N and weight (k 3 , k 3 + y). Indeed, 

h = I* I PPi 2 

7=1 71+72 = 7 (p- 1) 

with 

I pp 2 2 e spr 0l (p' +1 , N), a>', k 3 ), 

71+72 = 7 (p- 1) 

and by the uniqueness of character decomposition, 

fy — V P71P72 

j 3 Zj71 + 72=7 (P“ 1) j IJ 2 * 

From the suppositions,/^ 1 /^ 2 is a p-adic modular form of level N and weight 
(/Cl + k 2 ,k t + k 2 + y l + y 2 ), so p 3 e Jt(r(N), (k 3 , k 3 + y)). 

It is now possible to assume 5(r) and prove S(r -hi). Since 

^(F 01 (p r+1 , N), k)=® ^(r 01 (p r+1 , N), i, fc). 



282 


CARY QUEEN 


it is enough to verify S(r + 1) for/e Jt(T 0l (p r+ \ AT), fc). Let t = 4(p — 1), 
f Y = (/)£ f (^ pr “ 1 , <f),/ 2 = E^ t (q pr ~\ £). Then the hypotheses of the preceding 
paragraph hold for f t e Jt{T 0l {p\ N\ k + t) by 5(r) while they hold for f 2 
by Proposition 11. Hence the desired conclusion holds for / 3 =/ x f 2 = 
/: /= Xy=i f y with f y 6 ^(r 0 i(p r+1 , N), a>\ k) a p-adic modular form of 
level AT and weight (fc, k + y). 


4. p-adic ^functions 

Using Theorem 12, we can deduce a “constant term theorem” for certain 
forms on r 1 (p r , AT), which will in turn be used to establish the existence of 
p-adic abelian /^functions. First recall that there is a natural decomposition 
(Z/p r Z) x * (Z/pZ) x © (Z/p r 1 Z), p 2. Correspondingly, any character ^ 
on (Z/p r Z) x may be expressed uniquely as x = Xi Xi »where is a character 
of (Z/pZ) x and #2 is a character of (Z/p r_ *Z). Two lemmas will be needed, 
the first of which is a restatement of known results on generalized Bernoulli 
numbers. 

Lemma 13 Let x = X 1 X 2 be a character of (Z/p r Z), p f 2. Suppose 
k e N, /j = of and (p — 1) | (y + k). Then: 

(1) tf X 2 = 1, v p (B k x /k) = - 1 - f p (/c); 

(2) if X 2 f 1> Ik) — ~ l/(p — l)p m 2 where p m is the conductor of '/■ 

Proof If v is an integer not divisible by p, write v = co(v)(v). If 

Xx = x 2 = f the lemma is a special case of the theorem of Clausen-Von 
Staudt. Otherwise, consider the ideal p = (p, 1 — x(d)g k ) ' n Q(C< P -i) P 0, 
where g is a primitive root mod p. Then p = (p, 1 — )<'%)* 

<0> k ) = (p, 1 - /. 2 (g)<g> k ) # (1), since Xi( v ) is a p r th root of unity and 
t) p «t))-l) > 0 for all v. If Xi = 1. Theorem 3 of [1] thus implies 
pB k x = (p — 1) mod p, so v p (pB k ) = 0 and v p (B k /k) = — 1 — v p (k). On the 
other hand, if Xi # 1, the same theorem states (1 — *(1 + p))B x /k = 
1 mod p. Since x is of conductor p m , *(1 + p) = foO + P) must be a primitive 
(p m_1 )th root of unity, so v p (B k /k) = -v p (\ - x(l + P)) = - 1 l(p ~ l)p m ~ 2 - 

Lemma 14 For x = °f x 2 a character of (Z/p r Z) x , p f 2, let 

$k(x)= Z Xx(v)v k ~ t ‘i n - 

n = 1 t>|n 

// (p - 1)| (He + j), then $*(x) ^ •^(r 1 (p r , N), y, /c). (Here the summation is 
over positive v.) 

Choose an integer / large enough so that m = (p — l)p' — k > 4. Since 
v p (EJx) - 1) > 0, &(*) e Jt(Fi(p r , N), x, k) would imply f k (x) = 
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$ k (x)E m Of) e Jt(T 0 (p r , N), (p - Letting s = (p - l)p‘, 

X{v)v k ~ l = &(FVx 2 («)ct)(F)* _1 <i5>* _1 = d)(t>y + * -1 = a>(t)) s-1 . 

Hence f> k ( X ) = <f> s ( 1), and it suffices to show 

& = &(!)= £ 4 

n=l i7|n 

Assume the contrary, i.e., (f) s e .f/(r o (p r , N), s). By Theorem 12 any mod¬ 
ular form on r 0 (p r , N) may be approximated by forms on T(IV). In particu¬ 
lar, <j) s e.JX(r(N), t) with s = t mod(p — 1) and t > 4, which in turn imp¬ 
lies f> t eJX{T(N), t). If 4>, has reduction f>, and G k = — BJ2k + 
then /= G,-4>,e Jt(r(N), t). However, t> p (a 0 (/)) = 
v p (a 0 (G t )) = - 1 - v p (t) while inf na . l v p (a„(f)) > 0. Therefore, 

v p(Mf)) < inf v pMf)) ~ v p {t) - 1, 

n> 1 

contradicting Corollary C. 

Theorem 15A Suppose x is a character of (Z/p r Z) x , p =f= 2, with conduc¬ 
tor p m and Xi = of. Let 

/= £ a n q” e JliTtf, JV), X , k). 

n = 0 

(1) If (P ~ 1) \ (k + j), then v„(a 0 ) > inf„> , v p (a n ). 

(2) If (p - 1) | (k +j) and Xl = 1, then v p {a 0 ) > inf„> , v„(a„) - 

v p (k) - 1. 

(3) If (p-l)\(k+j) and Xl f 1, then v p (a 0 ) > inf^, v p (a n ) - 

1 /(P - 1)P" 2 - 

Proof Suppose (1) is not true, so we may take a 0 = 1 and 
inf„ & , v p (a„) > 0, i.e., / = 1 + n t c„ q", n a uniformizer for a field of 
definition of f Then f pT = 1 mod n, that is to say, v p (a 0 (f pr )) > 
inf„> | t; p (a n (/ pr )). Since the values taken by x and x 2 are respectively 
(p — l)th and p r th roots of unity, X pr = Xi , while X p f = L Hence 

f pr € M (Toj (p r , N), of, tfk) <= ,J'(r(N), (p'k, k + j)) by Theorem 12. The 

fact that v p (a 0 (f pr )) > inf„> x v p (a„(f pr )) is therefore a contradiction to Cor¬ 
ollary C. 

Case (2) follows directly from Theorem 12 and Corollary C. Finally, 
suppose (3) is not true, so we may take a 0 = 1, inf„> j v p (a„) > l/(p - l)p m ~ 2 , 
and k > 4, after replacing/by/• £ 2 < P - 1 > if necessary. Let g = L(1 — k, X )f 
so a 0 (g) = L(1 — k, X ) while inf„> t v p (a„(g)) > 0 by Lemma 13. Therefore, 
<j> k = (G*(x) — g )~/2 e Jt(T j (p r , N), X , k), which contradicts Lemma 14. 

There is also a decomposition of (Z/p r Z) x for p = 2, namely (Z/2 r Z) x ~ 
(Z/2 2 Z) X ©(Z/2 r-2 Z). And there is a corresponding decomposition 
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X = X 1 X 2 of any character x °f (Z/2 r Z) x . For p = 2, a> previously repre¬ 
sented the trivial character of (Z/2Z) X . Henceforward, let 0 instead repre¬ 
sent the nontrivial character of (Z/2 2 Z) X , so X\ = o) 1 or co 2 . As an analogue 
of Theorem 15A, we have for p = 2: 

Theorem 15B Suppose x is a character of (Z/2 r Z) x with Xi ~ of. Let 
/= iarf e N), X, k). 

n — 0 

(1) If2\(k + j), then v p (a 0 ) > inf„ 2 , v p (a„)) - v„(k) - 2. 

(2) If 2 1 (k +j) and Xi = 1. then v p (a 0 ) > inf„>, v p (a„) - v p (k) - 2. 

(3) If 2 1 (k + j) and Xi + 1, then v p (a 0 ) > inf„>, v p (a„) - \/2 m ~ 3 , 2 m the 
conductor of X- 

Proof If (1) or (2) is wrong, we may assume v p (f — 1) > v p (k) + 2. Then 
f pr e JX{T 0 (p\ N) 9 p r k) satisfies v p (f pr — 1) > v p (k) + 2 + r, or 

v P Mf P ')) < inf v p (a„(f p ')) - v p (p r k) - 2, 

n> 1 

which contradicts Corollary C. 

To prove (3), we begin by showing v p (a 0 ) > inf„> j v p (a n ) — 1. First 
observe that 


$k(x) = S Z xfa) 5 * VMflW, n), x, k). 

n=l 

Indeed, if the contrary held, 

UxY = I?-1 L-i- V* e -*(W. AT), p r fc), 

which implies (pf e Jt(T(N), t) for some even t > 4. Then / = G, \ V — 
<t>r e Jt(r(N), t) and v p (a 0 (f)) < inf„> l v p (a n (f)) - v p (t) - 2, a contradic¬ 
tion to Corollary C. Now if g e Jt(r i (p r , N), x, k) is such that 
v p (a 0 ) < inf„>! v p (a„) - 1, we may take k > 4 and v p (g - 1) > 1. By 
Theorem 3 of [1], v p (L( 1 — k, x)) > 0, so 

$k(x) = (G k (x )/2 -Uf-k, x)g/ 2)~ e .#(T t (p', AT), k), 

violating the previous observation. 

To show that in fact v p (a 0 ) > inf v p (a n ) — l/p m ~ 3 , suppose 
/ g ^(r i (p r , N), x> k), v p (f - 1) > l/p m " 3 with x of conductor p m , m> 3. 
Then - 1) > 1 with f pm ~ 3 e JX(T t (p\ N), x pm ~\ P m ~ 3 k), which is 

impossible because (x pm \ i= 1- 

It should be possible to replace the conclusion of Theorem 15B(1) with 
v p (a 0 ) > inf v p (a n ) - 1, completing the analogy with 15A(1). However, we 
have not been able to prove this if x has conductor 8. Theorem 15A has the 
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important consequence that if x has conductor p m with m > 1 (p 2), then 
v p (a 0 ) > inf,^ j v p (a n ) — X with X independent of k. This phenomenon is 
related to the fact that modular forms on r 0 (p r , N) with such a character 
need not be p-adic modular forms of level N. 

The next theorem, an analogue of Corollary D, is the final payoff of our 
theory of forms on r 0 (p r , N). It provides the necessary power to establish the 
existence of p-adic abelian L-functions. 

Theorem 16 Let 

/ <0 = X OnV e (p r , N), x (i) , ki), i = 1,2, .... 

11 = 0 

If p 2, suppose each x (l) = co Ui) x , with x cl fixed character of (Z/p r_1 Z). If 
p ----- 2, suppose each x (i) = X> a fixed character of (Z/p r Z ) x , and write j\ = 0 for 
each i. Finally , suppose that the a ( „ l) tend uniformly toward a n for n > 1, and the 
(K , k t + ji) approach a limit in X , nonzero if x = 1 or p = 2. Then the aff 
approach a limit a 0 . 

Proof After replacing each / (0 by we may assume 

/ (I) € JH k (T j (p r , AT), x, k ( ) for a fixed character The a ( „ i} still converge uni¬ 
formly for n > 1, and the /c, approach a limit in X, nonzero if ^ = 1 or p = 2. 
The proof is now the same as that of Corollary D, with Theorem 15 replac¬ 
ing Corollary C. 

We now take as our object of study p-adic abelian L-functions. 
Henceforth, let K be any totally real number field, L any abelian extension of 
K , and G the Galois group of L over K. If x is a character of G, there exists an 
ideal f of K called the conductor of x, such that x induces a character on the 
ray class group P r = J f /P f . Here / f is the set of ideals of K prime to f, and 

P f = {(a) g / f | a e K, a > 0, a = 1 mod f}. 

Conversely, by class field theory, if f is any K-ideal and x a character of P f , 
then x comes from a character of some abelian extension of K. 

If f is the least common multiple of f and (p), then a fortiori x induces a 
character of R v For s > 1, an abelian L-function is defined by Lf(x, $) = 
Za e/ f x( a )” s where the summation is over all integral ideals and / f and N 
denotes the norm from K to Q. L|(/, 5 ), or simply L(x, s), has an analytic 
continuation to the whole plane, with a possible pole at s = 1, and has a 
functional equation. As observed by Klingen [5], the values of L(/, 5 ) at 
negative integers s are contained in the same finite extension of Q as the 
values of x • It therefore is meaningful to define a “p-adic abelian /^function” 
from Z p to Q p by p-adic interpolation. In order to be more precise, let 8 be 
the character of defined by aj(Na) and discussed below. Let d = [K (£): K] 
where ( = if p f 2 and ( = ( 4 if p = 2. 
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Theorem 17 Let K be a totally real number field , L an abelian extension 
of K , and % a character of Gal(L/K). Then there exists a unique continuous 
function L p (x, s) from Z p to Q p , possibly undefined at 5=1, such that 
L p (x , 1 - n) = L s K (x9 ~ n , 1 - n) for ne N. In particular , at any negative 
integer s = 1 mod d , L p {x , 5 ) = L|(x, 5 ). 

Proo/ Let us first consider the character 0, following [2]. Denoting the 
abelian extension L(£) of K by L 0 , define a character, also written 0, of 
G 0 = Gal(L 0 /X) as follows: If peG 0 , choose an ideal re/j such that 
p = (r, L 0 /K) and let 0(p) = co(Nr). This is well defined because upon res¬ 
tricting to Q((), (r, L 0 /K) = p = (r', L 0 /K) implies (Nr, Q(Q/Q) = 
(Nr', Q(C)/Q), which implies a>(Nr) = a>(N r'). Since this new character of G 0 
clearly induces 0 on f and (p)|f, 0 is a ray class character of R. Moreover, 
(r, K(Q/K) d = 1 so (Nr, Q(C)/Q) d = 1 and 9(p) d = co(Nx) d = 1. Thus 
L p (x , 1 - n) = L(x9~ n , 1 - n) for all n e N gives L p (/, 1 - n) = L(x, 1 - n) 
for n = 0 mod d, i.e., the second assertion of the theorem is a special case of 
the first. Finally, for use below, note that if o is any embedding of L 0 into C 
and y is complex conjugation, then 0(<r" 1 yo) = — 1 because o' 1 yo(Q = £” L 

Since there is a natural map Gal(Lo /X) -► Gal(L/K), / can be viewed as a 
character on G 0 without changing the induced character on R f or the func¬ 
tion Lr(x, 5 ). First, suppose that x is not real, that is, for some embedding o 
of L 0 into C, x(c” = — 1. Whether n is even or odd, the T-factor in the 
functional equation of L(x9 ~ n , s) corresponding to o then has a pole at 
1 — n, and L(x9 ~ n , 1 — n) is always zero. Hence, if x is not real, L p (x , s) = 0 
satisfies the conditions of the theorem. And whether or not ^ is real, L p (x , s) 
is unique because the negative integers are dense in Z p . 

Now suppose that x Is real, so the generalized conductor of x contains no 
infinite primes. Then x(( a )) = 1 f° r a e K, a = 1 mod f; and /0 _ "((a)) = 
(sgn Naj" for such an a and any ne N. If (/) = Z n | /> 0, then x9~ n 
induces a Dirichlet character modulo f also denoted x9 ~ n , in a natural 
fashion. Moreover, by a theorem of Kloosterman-Siegel [10], 

W") - LOrT", 1 - *) 

+ 2 1 £ I Z (a)(Nof- V e ^(r^/), z0“". tfc) 

« = 1 jc, a 

if n and k have the same parity. Here t = [K : Q] and the inner summation is 
over pairs (x, a) with a e / , x e b“ 1 a, x > 0, and Tr x = n. Writing/ = p r N 
with pfN, x9 n induces a character of (Z/p r Z) x , and 
G k K (x9- n )eJt(r 1 (p',N\x9-\Tk). 

For x real, first suppose pfi 2. Given s g Z p , s / 0, choose a sequence of 
positive integers k t approaching s such that (p — l)|fc f . As i -> 00, the non- 
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constant terms of the G^(x) uniformly approach limits, specifically, 

Z x(a)W(°)'‘ , “ 1 -*• Z x(ayo(Na)-\Nay-K 

Tr x~n Tr x = n 

Therefore, by Theorem 16, the constant terms L(x, 1 — k t ) also approach a 
limit, call it L p (x, 1 — s). Theorem 15 shows L p (x , 1 — s) is a continuous 
function of 5 because if i; p (fc| 2) — k\ l) ) is large, so is 

inf v p (a n (G k £ { } (x) — G^ ( ^X^jm 2 )-*!* 1 )))* 

n> 1 

Thus it only remains to show L p (x, 1 - n) = L(x0 _w , 1 - rc) for n 6 N. If 
ki-+n with (p- l)|fc„ then X xM^a)* 1 " 1 Z x( a ) 0 ( a )“ \Na} n ~ 1 = 
£ X0 _ "(a)iV(a) n_ Since iV(x^ K ) — (x) T for xeZ, the character induced by 
0on (Z/p'Z) x is co\ Hence 

/i = Gk(x) - G" K (x0-")£ T(ki - w) K) e ^(p', AT), x, t/c,), 

and applying Theorem 15 to f as i -► oo gives the desired result. 

Finally, suppose x is real and p = 2. Given a nonzero s e Z p , choose 
k t -+ s, the k t necessarily even if s e 2Z 2 and necessarily odd if s £ 2Z 2 . By 
the same reasoning as above, the constant terms of G^(x), respectively 
G*(x0 _1 ), approach a limit if s e 2Z 2 , respectively s £ 2Z 2 . Moreover, as 
above, if this limit is denoted L p (x, 1 - s), then L p (x, 1 — n) = 
L(xO~ n , 1 — n). L p (x, 1 — n) is continuous on 2Z 2 — {0} and on Z 2 — 2Z 2 , so 
it is indeed continuous on all of Z 2 - {0}. 

Theorem 18 If the Dirichlet character induced by x has conductor divi¬ 
sible by p, then L p (x, s) is defined at s = 1 (p f 2). 

Proof If the p-conductor part of the induced character is co J x ( 2 p) , then 
either j =f= 0 or x ( 2 P) f 1 1- Theorem 15A(1) or 15A(3) may then be used to 
establish the convergence of L(x, 1 - k t \ k t -+ 0, as above. 

Theorem 18 should remain true under the weaker hypothesis that the 
Dirichlet character induced by x is nontrivial. Indeed, L p (x, s) is probably 
defined at s = 1 if and only if x itself is nontrivial, but the proof of this no 
doubt requires more advanced techniques. 
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A Characterization of the Line- 
Hyperplane Design of a Projective Space and 
Some Extremal Theorems for Matroid Designs t 
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Combinatorial characterizations of some incidence structures obtained from projective spaces and 
affine spaces over finite fields are given. Also some extremal theorems for matroid designs are 
proved. 


1. Introduction 

An incidence structure is a triple (X, $,l\X and $ are finite sets, and 
/ 9 I x J. Elements of X are called treatments, while elements of & are 
called blocks. If (x, B) e /, we say that x is incident with £, and denote it by 
xeB.We shall follow the usual notations of incidence structures, and often 
consider a block as the set of treatments incident with it. If D and D' are 
isomorphic incidence structures, we write D ~ D'. For any Bel, we shall 

t This research was supported in part by NSF grant MPS 75-08231 and ONR contract 
N00014-67-A-0232-001 6. 
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denote by | B | the number of treatments incident with it. Incidence struc¬ 
tures are also called designs; for various definitions connected with in¬ 
cidence structures, see Dembowski [4]. 

An incidence structure D = (X, & 9 1) is said to be BIBD (v 9 b, r , k, X) iff (if 
and only if) it satisfies following conditions: 

(a) \X | = v, \£\ = b. 

(b) \B\ = k for all Bet 

(c) Each treatment is incident with exactly r blocks. 

(d) Every pair of treatments is incident with exactly k blocks. 

Parameters (t;, b, r, k, k) satisfy the equations 

k(v-l)=r(k-l) (1.1) 

bk = rv (1.2) 

b > v (Fisher’s inequality). (1.3) 

A BIBD ( v , b, r, k, k) with v = b and hence r = k is called an SBIBD 
( v , k, k). For further results on BIBDs, see [4]- 

An incidence structure G = (X, I) is called a simple graph iff | B \ =2 
for each B e and if B u B 2 e B t ^ B 2 , B x and B 2 are distinct treat¬ 
ment sets. Treatments of a graph are also called vertices, while blocks are 
called edges. Two vertices x and y are said to be adjacent iff there exists an 
edge incident with both x and y. A simple graph G — (X, /) is said to be a 

strongly regular graph G(v 9 n l9 p\ u p xl ) [2], iff it satisfies the following 
conditions: 

(A) | AT | = v. 

(B) Each vertex is incident with exactly n x vertices. 

(C) If x, y e X, x ^ y, there are exactly p\ j or p\ x vertices z such that z 
is adjacent to both x and y according as x and y are adjacent or not adjacent. 

Let X be a set of v elements and G = G(v , n u p\ u ph) be a strongly 
regular graph with vertex set X. An incidence structure D = (X, I) is 
called a PBIBD (v, b , r, k, k u k 2 \ with association graph G(v , n u p\ u p 2 Xi ) iff 
following conditions are satisfied: 

(i) |x| = v, \a\ = b. 

(ii) | B | = k for each B e 

(iii) Each treatment is incident with exactly r blocks. 

(iv) Any two distinct treatments x and y occur together in exactly k x or 
k 2 blocks of D according as x and y are adjacent or not adjacent in G. 

We shall denote the association graph of a PBIBD D by G(D). For 
various examples and results on PBIBDs, see [4]. We shall denote by 
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PG(d, q), a projective space of dimension d over a finite field GF(q), while by 
AG(d, q) we shall denote a ^-dimensional affine space over GF(q). 

For d > m > l > 0, define an incidence structure P, m (d, q) as follows. 
Treatments of P, m (d, q) are all /-dimensional subspaces of PG(d, q). An 
incidence relation is given by containment. A, m (d, q) is similarly defined by 
taking AG(d, q) instead of PG(d, q). When no confusion arises we shall also 
write P, m for P, m (d, q) and A, m for A, m (d, q). The following results are well 
known and can be proved easily by using common properties of projective 
and affine spaces: 


(a) Po, m (d , q) and A 0i m (d, q) are BIBDs for all d > m > 0. 

(b) If d > 3, Pj d -i(d, q) is a PBIBD (v, b, r, k, A 2 ), where 

d d ~ l — 1 

r = 



b- qd+1 ~ l 

1 

fo 

1 

1 

b ~ 9-1 ’ 

t _ (9 d -l)^ -1 -1) 

I 9 d ' 2 - 1 

? 

1 

S" 

K> 

1 

1 

Al " q- 1 ’ 




q - 1 
q*- 3 - 1 


q - 1 


G(Pi,j-i) is a strongly regular graph (v, n u p{ u p 2 u ) where 


«i = («+ 1>? 




Pi 1 = 


q d ~ 1 

q - 1 


-2 + q 2 


pit = (q + i) 2 - 


(1.4) 


(1.5) 


( 1 . 6 ) 


Any two vertices in G(P 1 are joined by an edge iff they are intersecting 
lines in PG(d , q). 

By “replacing” a desarguesian plane by nondesarguesian plane in 
PG(d , q) or AG(d , q) one can construct BIBDs, with the same parameters as 
those of P 0 m or A 0t m , but nonisomorphic to P 0 m or A 0 m , respectively (see, 
for example, Mavron [8]). Dembowski and Wagner [5] proved a characteri¬ 
zation theorem for P 0 ,d- i(d, q) for d > 3. In Section 3 we prove the follow¬ 
ing characterization theorem for P l d _ 1 (d, q) for d > 6. 


Theorem 1 Let d and q be positive integers , d > 6, q > 1. Suppose D x is a 
PBIBD (r>, b , r, k, A u A 2 ), where parameters are given by (1.5), and G(D t ) has 
parameters ( v , n u p\ x , ph), given by Eqs. (1.6), then q is a prime power , 


~ q) and G(D X ) ^ G(P 1 , d _ 1 ). 

We define a combinatorial geometry (or matroid) by the “hyperplane 
axioms.” An incidence structure D = ( X , /), is said to be a combinatorial 
geometry iff it satisfies following conditions: 

(a) Given H u H 2 e H x H 2 , H x is not a proper subset of H 2 . 
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(b) Given H u H 2 e H l f H 2 , and x e X, there exists a block 
H 3 e s.t. (such that) H l n H 2 u {x} £ // 3 . 

(c) For each xel, nH = {x}, where intersection is taken over all 
blocks H s.t. x e H. 

Treatments of a combinatorial geometry are called points, while blocks 
are called hyperplanes. Combinatorial geometries have been studied in 
detail, and for various results on combinatorial geometry, we refer to Crapo 
and Rota [3]. We shall follow the notation of Crapo and Rota [3]. 

A combinatorial geometry D = ( X , I) is called a geometric design iff 
D is also a BIBD. A regular geometric design is a combinatorial geometry in 
which all flats of rank i have the same cardinality It can be easily seen 
that a regular geometric design is indeed a geometric design [12]. Geometric 
designs have been studied by Edmonds, Young, and Murti [12], in particular 
they have given many examples of such designs. We list a few well-known 
examples of regular geometric designs. 

(a) A t — (v, k, X) design [4] with X = 1 is a regular geometric design of 
rank (t + 1) s.t. 

Mi = i for 0 < i < t and m t = k . 

(b) P 0 m is a regular geometric design of rank m + 1 s.t. 

d - 1 

= -- for 0 < i < m < d. 

q - 1 

(c) A 0 m is a regular geometric design of rank m + 1 s.t. 

m i — q l ~ 1 for 0 < i < m < d. 

We note that m 0 = 0, = 1; and m 2 is the size of a line in any regular 

geometric design. Clearly m 2 > 2. 

In Section 4 we shall prove the following results. 

Theorem 2 If D = ( X , I) is a regular geometric design of rank n > 4, 

then: 

(a) m, — m,_ x > (m 2 — l)(mi_ l — m l _ 2 ) for n > i > 3. 

(b) Equality holds in (a) for any iiffD~ P 0 , n (d , q)/or some prime power 
g and d>n+ \ or m 2 = 2 and D is a 2>-design. 

Geometric designs of rank 3 are precisely BIBDs with X = 1. We also 
study geometric designs of rank 4 in Section 4, and show that all such 
designs are regular and examples (a)-(c) given above are the extreme cases of 
certain inequalities to be satisfied. 
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Theorem 3 IfD — (X, I) is a regular geometric design of rank n> 4, 
then: 

(a) m, — mj > (m 2 — 1 ) l ~ j (m i - 1 — m,- j) for n > i > j > 2. 

(b) Equality holds in (a) for any i iff D ~ P 0 „(d , q)for some prime power 
q and d>n+l or m 2 = 2 and D is a 3-design. 

Theorem 4 Let D = ( X , I) be a geometric design (v , b, r , k , A) o/ rank 

4, then: 

(a) = k mod(/l — 1). 

(b) /Uc - 2(A - 1) > p > Afc - i(A - 1) - i(/l - 

(c) v = 2k — 2(2 — 1) iff D is a 3-( i;, k, l)- design. 

(d) v = Afc — — 1) — - 1 ) x /4fc — 3 iff k = q 2 + q+\ for some 

power q and D ~ P 0 2 (n, g) where n is gwen byv = q n+ l /(q — 1) or k <1 and 
D is a 3-(v, fc, 1 )-design. _ 

(e) If v > \k — — 1) — — l)y/4k — 3, then v > Xk — (2 — 1 )y/k 

and for k > 16 equality holds iffD~ A 0t 2 (n , g) where k = g 2 /or some power q 
and v = q n . 

2. Preliminaries 

In this section we shall state various known results on characterizing 
geometries, which we shall be using in the next section. 

Let D be a BIBD (u, b, r, k , A) with A = 1. It is well known that the dual />' 
is a PBIBD (b, u, k, r, A l5 A 2 ) with ^ = 1, A 2 = 0 s.t. G(D') is a strongly 
regular graph with parameters (b, r(k — 1), r — 2 + (k — l) 2 , k 2 ). Any two 
vertices B u B 2 are joined in G(D') if and only if \B X n B 2 1 = 1 in D. 

The following result is a particular case of a well-known theorem of Bose 
on partial geometries [2]. 

Proposition 5 If G = (V, £, I) is any strongly regular graph with par¬ 
ameters (b, r(k — 1), r — 2 + (k — l) 2 , k 2 ), where b, r, k arc integers s.t. 

r > \[k{k - 1) + k(k + l)(k 2 - 2k + 2)] 

then there exists a unique BIBD D(v , b, r, k, A) with A = 1, v = r(k — 1) + 1 s.t. 
V is the set of blocks of D and G(D')= G. 

Let D = ( X , /) be an incidence structure. For x, y e X, x f y, line xy 

is defined by 

xy= f]B 

where intersection is taken over all blocks B s.t. x, y e B. 

The following well-known theorem due to Dembowski and Wagner [5] 
gives a characterization of the incidence structure P 0 ,d-i(^ q) obtained 
from PG(d , q). 
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Proposition 6 IfD is a symmetric BIBD (v, b, r, k, A), A > 1 s.t. its dual D' 
is a combinatorial geometry, then D cz P 0 t (d, q) for some positive integer d 
and prime power q. 

We note that the dual D' of a BIBD D(v, b, r, k. A) is a combinatorial 
geometry iff every line of D meets every block of D. 

Let D = (X, if, /) be an incidence structure. For each block B of D we 
define a new incidence structure D B as follows. Treatments of D B are treat¬ 
ments of D incident with B. Blocks of D B are the lines of D contained in B. 
The following results on regular geometric designs of rank 4 are well known. 
The first one is essentially the definition of a projective space. The second is 
due to Buekenhout [1]. 

Proposition 7 IfD is a regular geometric design of rank 4 s.t. m 2 > 3 and 
D b is a projective plane for all blocks B of D , then D ~ P Q 2 (d , q)for some d 
and prime power q. 

Proposition 8 IfD is a regular geometric design of rank 4 s.t. m 2 >4 and 
D b is an affine plane for all blocks B of D, then D ~ A 0i 2 (d , q)for some d and 
prime power q. 


3. Proof of Theorem 1 

Let d and q be positive integers, d > 6. Throughout this section we will 
assume that D x — (X, 88, /) is a PBIBD (v, b , r, k, X u X 2 ) with association 
graph Gfv, n l9 p\ u p where these parameters are given by Eqs. (1.5)and 
(1.6). The proof of Theorem 1 is essentially based on the Propositions 5 and 
6. We first prove the following simple lemma. 

Lemma 9 The dual of D x is a BIBD with parameters (fe, v, k, r, X) where 

(,j-i)(<z 2 -i) • 

Proof We have only to prove that any two blocks of D x intersect in X 
treatments. The following two equations are obtained easily by counting the 
occurrences of treatments and pairs of treatments in blocks of D x : 

X \B l n B 2 \ = b(r-l)k = vr(r - 1) (3.1) 

Bi, B 2 e& 

and 

X IB, nB 2 \(\B l nB 2 |-l) = m 1 A 1 (A 1 -l) 

b i, 

+ v(v — 1 — ni)A 2 (A 2 — 1) (3.2) 
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where the summations in (3.2) and (3.3) are over all pairs of distinct blocks 
Bi and B 2 of D v Using (3.2) and (3.3) we can derive the following equation 
easily: 

I (|Bi nB 2 |-l) 2 = 0. (3.3) 

B i, B 2 e 9& 

From (3.3) it follows that | B 1 n B 2 | = A for all blocks B 1 and B 2 of D u 
B x =/= B 2 . This completes the proof. 

Consider the PBIBD D l = (X , /). Let / e X and Bel. Let t t denote 
the number of treatments l x eX s.t. f e B and the vertices l and are joined 
in G v Suppose l e B , then counting the occurrences of treatments of B in the 
remaining blocks of D l9 containing / and using Lemma 9 we have 

(^1 ““ 1)*/ + (^2 — 1)(^ ~ 1 — t t ) = (A — l)(r — 1) 

since — A 2 = g d_3 =£ 0, solving the above equation for t l9 we get 

t, = (q + - ) for leB. 

Similarly, if / e B, we can obtain the following equation 

A^ ti + A 2 (k — £j) = Ar 


and hence 


*i = 4 





+1 = 


<? — 1 


Thus we have proved 


for / ^ B. 


Lemma 10 For l e X and Bel, 1/ t, is as defined above, then t, is 
independent of the choice of B and is given by 


and 


t t = (q+ 1 )q 

. _4 




for l e B 


• d ~ 1 — l 


q - 1 


for l i B. 


Lemma 11 Let d > 6. There exists a unique BIBD D 2 = (X 2 , X, I 2 ) 
with parameters 

(gd + i_i)(^_i) q d_i 


AT*- 1 
\ «~1 ’ 


(q 2 -l)(q~l) ’ q- 1 


, q + 1 


4 


s.£. G(Z) 2 ) ~ G x . 


Proof Since d > 6, the conditions of Proposition 5 are satisfied for the 
graph G x and the result follows from the same. 
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From now on we shall assume that d > 6 and D 2 is the design defined by 
the above lemma. We note that blocks of D 2 are the treatments of D v We 
define a new incidence structure D 3 = ( X 2 , / 3 ) as follows. Treatments of 
D 2 are the treatments of D 3 , while the blocks of D 1 are the blocks of D 3 . A 
treatment x of D 3 is incident with the block B iff there exists Z e X s.t. x e Z in 
D 2 and l e B in D 1 . 

We proceed to compute the block size k 3 in D 3 . Let B be a block of D 3 . 
For each treatment x € B in D 3 , define a x to be the number of treatments / of 
s.t. Z g B in D 1 and x e l in D 2 . We first show that a x = ( q d ~ 1 — 1) | q — 1 
for each x e B in D 3 . Let Z be any treatment of D x s.t. I e Bin D x . Then using 
Lemma 10 and the definition of D 2 , etc., we have 

I «.-<«+ + (3-4) 


where the summation is over all the q + 1 treatments x e Z in D 2 . 

Also if for each x e / in D 2 there is some treatment l' of D u l' £ B in D x s.t. 
x e r in D 2 , then again using Lemma 10, etc., we have 


9-1 9-1 


Using (3.4) and (3.5) it follows that 




■f 1 for all x g / 

1 


or there is some x e l in D 2 s.t. a x = 


q - 1 


Suppose there exists x e B in D 3 s.t. 


a r = 


<?** — i 

9 - 1 


(3.5) 


(3.6) 


(3.7) 


We next show that for all z ^ x, z e B , a z < q{q d ~ 2 — l)/{q — 1). 

Now if z is any treatment of D 3 , z e #, z ^ x, then using (3.7), if l' is the 
unique block of Z> 2 containing x and z, l' g B in D t . Hence using (3.4) for l' 
we have 


a z <(9+l)9( i qZT i ) 



(3.8) 


for all z g B in D 3 , z ± x. 

Now since the block size in D x is (<? d - l)(g d_1 — l)/(<? 2 - !)(<?“ 1)> 
there is at least one treatment 1' of s.t. I' e B in but x £ l' in D 2 . 
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Equation (3.6) for l' and Eq. (3.8) give a contradiction. Hence 


a* = 9 



+ 1 = 


q d -\~ 1 
q - 1 


(3.9) 


for all x e B in D 3 . 

Now using (3.9), by counting the number of elements in the set 
{(x, /) |x e / in D 2 , l e B in D x } we have 



tf-W-'-i) 

(q-l)(q 2 ~l) 


(q + 1). 


Hence 


k 3 = {q+ 1) 



q d ~ 1 

9-1 


for all blocks B of £> 3 . We can now prove the following lemma. 
Lemma 12 Let d > 6. TTien D 3 is nn SBIBD 


(3.10) 


/ <7 d+1 -l 9*~1 9 d ~ i -n 
\ 9-1 ’ 9-!’ 9-1 /' 

Proof It is obvious that the number of treatments or the number of 
blocks in D 3 is 


= 


q d+1 - 1 
9 ~ 1 


Using (3.10) it follows that each block of D 3 is of size k 3 , where 





q ~ 1 ‘ 


Hence we have only to prove that any two treatments of Z) 3 occur in exactly 
(q d ~ 1 — l)/(<? — 1) blocks. Suppose and x 2 are two treatments of D 3 . 
Then there is a unique treatment / of D x s.t. x u x 2 e / in D 2 . Now l occurs in 
exactly r = (g d_1 — l)/(g — 1) blocks of Dj. Hence x 1? x 2 occur in at least 
(<? d_ 1 — l)/(q — 1) blocks of D 3 . Counting the elements of the set 

{(y, z, B) | y 9 z e X 2 , y, z e B in D 3 , B e 

we easily derive that in D 3 any two distinct treatments are incident with 
exactly (q d - 1 )/(q — 1) blocks. This completes the proof. 

We can now complete the proof of Theorem 1. We first note that the 
blocks of the BIBD D 2 correspond to the lines of D 3 . For l e X , let (/) denote 
the elements of X 2 incident with / in D 2 . It is easily proved that for x, y e (/), 
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(/) is the line generated by x and y in D 3 . For the sake of brevity, / will also 
denote the line of Z) 3 . Using Lemma 10 given any block B of Z) 3 and a line / 
of D 3 , there exists at least one treatment m of s.t. m e B and l and m are 
incident with a common block of D v Therefore any line and any block of D 3 
intersect. Thus the conditions of Proposition 6 are satisfied. Hence 
D 3 - P 0 ,d-i(d, <?)• It follows that D x ~ P Ud _ x (d 9 q) and G x ~ G(P 1>d _ 1 ) 
and g is a prime power. 


4. Geometric Designs 


In this section we shall study regular geometric designs. We shall also 
study geometric designs of rank 4. We shall first prove Theorem 2. 

Let D = (X , I) be a regular geometric design of rank d > 3. Let 
d > j > 3. Let P ^ X be any (7 — 3)-flat of D. And Q ^ P be a j-flat of D. We 
define an incidence structure D P Q as follows. Treatments of D P Q are all 
(j — 2)-flats that contain P and are contained in Q. Blocks are all 
(j — l)-flats that contain P and are contained in Q. Incidence is given by 
containment. The following result can be proved easily. In fact the usual 
proof for projective spaces can be extended to regular geometric designs 
[ 12 ]- 

Proposition 13 D P Q is a BIBD with parameters 

/ yyij — nij-3 (fflj-m / - 3 )(ffl J --mj- 2 ) ntj - rrij - 2 m j - l - mj - 3 \ 
\mj-2-mj - 3 ’ (nij.i - m J - 3 )(m J - l - my-i-W/V m ,_ 2 - 3 ’ / 

Proof of Theorem 2 Let D = (X, /) be a regular geometric design as 

defined above. Using (1.2) and (1.3) for D P Q we have 

mj-mj-2 > ™j -1 -Mj- 3 
m j -1 - m J _ 2 “ Wj-2 ~ rrij - 3 * 

Hence 

—=*-^ for all d>j>3. (4.1) 

mj-2-rrij- 3 

Using (4.1) it follows that 


mi -1 — rrii-2 


> m 2 — 1 


for all i > 3 


hence m* — m*.! > (m 2 — 1 )(*?!*_ j — m f _ 2 ) for d > i > 3. This proves state¬ 
ment (a). Now suppose 

m t — m,-! = (m 2 — l)(m t _ x 2 ) for some i. 
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Then we will have equality in (4.1) for all j, i >j > 3. In particular 


Hence 


m 3 -m 2 = (m 2 - l) 2 . 


m 3 = (w 2 ~ l) 2 + ( m 2 - 1) + 1- 


Thus if Q is a rank 3 flat of D and 0 denotes the O-flat, then D Q 0 will be a 
projective plane for all Q. Now let be an incidence structure defined as 
follows. Treatments of D x are the treatments of D. Blocks of D x are all the 
3-flats of D. It is obvious that D x is a regular geometric design of rank 4. If 
m 2 > 3, then D x satisfies the conditions of Proposition 7 and hence 
D 1 ~ P 0 , 2 (d, q) for some d and prime power q. Now it follows easily that 
D tx. P 0 n (d, q) and d > n + 1. If m 2 = 2, then any three points determine a 
unique 3-flat of D , and hence clearly they are incident with exactly 


K ~ m 3 )(m„ - m 4 ) • • • (m„ - m„_ 2 ) 
(w„_! - m 3 ) ••• (m„_! - m„_ 2 ) 


blocks of D. Thus D is a 3-design. This completes the proof of the theorem. 
Theorem 3 is the obvious inductive extension of Theorem 2. 

For the rest of this section we shall assume that D = ( X , /) is a 

geometric design of rank 4. Let the parameters of D be (i v , b , r, k, 2). Since 
rank D = 4, A > 1. 


Lemma 14 All geometric design of rank 4 are regular. 

Proof Let D be a geometric design of rank 4, with parameters (v, b , r, k, 
A). Let x, y e X, x ^ y. Let / be the unique 2-flat containing x, y, i.e., line xy. 
Now from the properties of combinatorial geometries it follows that given 
z e X, z £ l there is a unique block B of D s.t. / u [z] c B. Hence counting 
the number of elements in the set {(z, B) \z e X — l, z e B, l ^ B], we have 

X(k- |!|) = »- |/|. 

Hence 

Xk — v 
k-l ' 


Thus the cardinality of a line / is independent of the choice of the line l Proof 
of Lemma 14 is now complete. 

We now prove the final result of this paper. 

Proof of Theorem 4 Using Lemma 14, D is a regular geometric design 
with 

Xk — v 


m 2 — 


X- 1 


and m 3 — k. 


(4.2) 
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Since m 2 is an integer, v = k mod (A — 1). Also since m 2 > 2, 

Xk — v > 2(A — 1). 

Hence 

v < Xk - 2(A - 1). 

Suppose v — Xk — 2(X — 1). Then m 2 = 2 and clearly D is a 3-(u, fc, l)-design. 
Thus we have proved (c). We now prove the remaining parts of Theorem 4. 
Using Theorem 2(a) for i = 3, we have 

k > m 2 (m 2 — 1) + 1. (4.3) 

Hence using (4.1) we have 

v 2 — v(2Xk — (X — 1)) + X 2 k 2 — (A — l) 2 k < 0. 

Hence 


v>Xk- - l - JJ?Jk - (A - 1 )) 2 + - (1 - i) 2 /c) 


i.e., 

„ X-l A — 1 f-t -y 

v>Xk --- y~ V ~ 3 - 

This proves (b). Now suppose v = Xk — ?(X — 1) — i(A — 1)^/4 k — 3; then 
we will have equality in (4.3) and the statement (d) follows from Theorem 

2(b). ._ 

Now suppose v > Xk — \(X — 1) i(A — l)v4 k — 3. Hence 

k > m 2 (m 2 — 1) + 1. 


Thus if P is any 3-flat of D and 0 denotes the 0-flat, then D P 0 is not an 
SBIBD. Parameters of D P 0 are 


Hence 



- 1) 
m 2 (m 2 ~ 1 )’ 


m 3 - 1 
m 2 - 1 



m 3 - 1 
m 2 — 1 


m 2 + 1 


i.e., k>m\. Hence using (4.2) we have 

v 2 - 2kkv - (A - 1 ) 2 k + k 2 k 2 < 0. 


(4.4) 
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Hence 

i> > Xk - - 4[(A - \fk + PP] 

he., 

i? > Xk - {X - 1)^/1 

If ^ — (A — ljy/fe and k > 16, then we will have equality in (4.4) and 

m 2 > 4. Hence all designs D P 0 are affine planes, and the statement (e) 
follows from Proposition 8. This completes the proof of the theorem. 

A natural question arises. Are there geometric designs of rank 4 different 
from those given by cases (c)-(e) of Theorem 4. We note that using examples 
of Steiner triple systems due to Hall [6], one can construct one such geome¬ 
tric designs of rank 4 (see also Teirlinck [11]). This example also shows that 
the hypothesis k > 16 in statement (e) is necessary. An interesting problem 
will be to classify all geometric designs of rank 4. 
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On the Behavior of Ideal Classes in 
Cyclic Unramified Extensions of Prime 
Degree 
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If K/F is a cyclic unramified extension of number fields of prime degree p and F _, K : C1{F) 
C1(K) is the ideal class mapping , then #(Ker(4 # F ^ K )) is bounded above by p r + w ~ 1 where r is the 
rank of E F as a Z-module , w = 1 ifF contains the pth roots of unity , and w = 0 otherwise. Noticing 
that this bound goes to infinity along with [K : Q], we ask the question , Can one construct a 
sequence of extensions K/F for which #(Ker('F F ^ JC )) goes to infinity along with [K : Q]? This 
question is settled by the following theorem which we prove here: 

Let L be any number field and p a fixed rational prime. Then there exists a cyclic extension F of 
degree p over L which in turn possesses a cyclic unramified extension K of degree p over F 
satisfying p r divides #(H i (Gal(K/F ), E K )) where r is the rank of E L as a Z-module. 


1. Introduction 

Let F be a number field and K a finite Galois extension of F. The 
extension K/F is said to be unramified if no prime of F (finite or infinite) 
ramifies in K/F. If 31 is a nonprincipal ideal of F, then we say that 31 
capitulates in K/F if the K-ideal is principal. There is a natural homo¬ 
morphism C1(F)-C1(K) given by C1 («)m>C 1(<UD*), where C1(F) 

f Present address: 6300 Golden Hook, Columbia, Maryland. 
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denotes the ideal class group of F and Cl(^l) denotes the ideal class to which 
21 belongs. Thus capitulates in K/F if and only if Cl(9l) belongs to 
KerWhen this occurs, we also say that Cl(9l) capitulates. As long as 
K and F are understood, we shall write instead of 

There are two basic tools for analyzing Ker(T'). The first concerns a 
group theoretic homomorphism called the transfer. The second, and the one 
we shall use, is the following theorem due to Iwasawa [2]: 

Theorem 1 Let K/F be an unramified Galois extension of number fields 
with Galois group G, and let E K denote the group of units of K. Then 
Ker('T) ~ /^(G, E K ). 

Now let K/F be a cyclic unramified extension of prime degree p . There is 
a natural lower bound on #(Ker(*F)), namely for such an extension, p 
divides # (Ker('F)). This result is known as Hilbert’s theorem 94 and follows 
from Theorem 1 and the well-known fact that 

#(H°(G, E K ))/#(H l (G , E k )) = [K:F]~ 1 

(see for instance Lang [3, p. 192]). There is also a natural upper bound on 
#(Ker( v F)). 

Lemma 2 #(Ker(T / )) divides p r+w+i where r is the rank of E f as a 

Z-module , w = 1 if F contains the pth roots of 1, and w = 0 otherwise . 

Proof By Theorem 1, Ker(T') ~ H 1 (G , E K ) where G = Gal (K/F). Also 
by the Herbrand quotient, #(// 1 (G, E K )) = p #(H°(G, E K )). Now 
H°(G, E k ) = E F /N K/F {E K ) and ££ ^ N K/F (E K ). Moreover, as a Z-module, 
E f ~ W © Z r where W is the cyclic group of roots of unity of F. Thus E F /E P F 
has order p r+w where w = 1 if p divides #(W) and w = 0 otherwise. Thus 
#(H°(G, E k )) divides p r+w so that #(Ker(4 / )) divides p r + w+1 . | 

Now r is equal to r x + r 2 — 1 where r l is the number of real embeddings 
of F into C and r 2 is the number of conjugate pairs of complex embeddings 
of F into C. Thus we see that the upper bound of Lemma 2 goes to infinity 
along with [F : Q]. If N: C\(K) -► C1(F) denotes the homomorphism 
induced by the norm of ideals, then 

NeF(Cl(91))) = Cl(9t) p 

so that Ker('F) is an elementary abelian p-group. Let t be its dimension as a 
vector space over Z/pZ. We consider the following question. Can we find 
cyclic unramified extensions K/F of prime degree p for which t is as large as 
we wish? That the answer to this question is yes follows from Theorem 1 and 
the next theorem whose proof will absorb the remainder of this paper. 
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Theorem 3 Let L be any number field and p a fixed rational prime. Then 
there exists a cyclic extension F of degree p over L which in turn possesses a 
cyclic unramified extension K of degree p over F satisfying: 

p r divides #(H 1 (Gal(K/F ), E K )) 
where r is the rank of E L as a Z-module. 


2. Proof of Theorem 3 

We will assume throughout that L is neither Q nor any imaginary quadra¬ 
tic field since in these cases r — 0 and the theorem is trivial. To prove the 
theorem, we shall construct a cyclic extension K x of L of degree p for which 

N KllL (E Kl ) = El (1) 

and 


K x /L is unramified at the infinite primes 

of L and there exists a finite prime of 

L ramified in KJL (2) 

Then we shall construct another cyclic extension K 2 of L of degree p such 
that 


If p is a finite prime of L ramified in 


X x /L, then p is unramified in K 2 /L 

(3) 

The compositum K = K X K 2 is such that 


N k ,k 1 (E k ) = E 

(4) 


Before actually constructing K x and X 2 , let us show that their existence 
suffices to prove the theorem. 

First, K j and K 2 are linearly disjoint over L. For there exists a prime p of 
L ramified in K x /L and such a p is unramified in K 2 /L Thus Gal {K/L) ~ 
Z/pZ © Z/pZ and so there exists a subfield F of K containing L with F 
distinct from K x and K 2 and with [K: F] = [F : L] = p. Choose any such F. 
We shall show that K/F satisfies the conclusions of the theorem. 

It is easy to see that K/F is unramified, for suppose is a finite prime of 
F ramified in K/F. If p is the L-prime that divides, then p ramifies in K l /L 
and in K 2 /L, which is impossible by the construction of K 2 . Moreover, the 
infinite primes of F are unramified in K/F since the infinite primes of L are 
unramified in K x /L. 
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It remains to show that p r divides E K )) where G — Gal(K/F). 

Let W K denote the group of roots of unity of K. By (1) and (4), 
N K /l(Ek) — ££ 2 - ^ then follows that 

E k /W k ^ e l w k /w k ® ( Nkil (E k ))W k /W k 

where Nkil (E k ) is the group of units rj of K for which iV K/L (?|) = 1 and the 
direct sum is as Z[G]-modules. For if £ e E K , then N K/L (£) = rj p2 where 
rj e E l is unique modulo roots of unity and £rj 1 e n k/l (Ek)- Thus 

H°(G, E k /W k ) ~ H°(G , E l W k /W k )@H°(G, ( Nk/l (E k ))W k /W k ). 

We cannot say much about the second factor in this direct sum decomposi¬ 
tion, but since G acts trivially on E L W K /W K and E L W K /W K is a free Z- 
module of rank r, # ( H°(G , E L W K /W K )) = p r and thus we see that p r divides 
#(H°(G, E k /W k )). Then by the Herbrand quotient, p r+1 divides 
#(H 1 (G, E k /W k )). Now from the exact sequence of G-modules, 

1 -* W K -> E k E k /W k -► 1, 

we obtain an exact sequence of cohomology groups, 

-► H 1 (G, W K ) -► H\G,E k ) 

H'&EJWk) H°(G,W k ) - 

But #(H°(G, W K )) divides #(W K /W P K ) = p w where w = 1 if K contains the 
pth roots of 1 and w = 0 otherwise. Hence the order of Im(7c*) = Ker(<5) is 
divisible by p r+1 ~ w so that p r divides #(// 1 (G, E K )). 

3. Construction of K { and K 2 

To complete the proof, we must construct and K 2 - We shall con¬ 
struct K x first. 

Let 1, 8 U e 2 ,..., e s denote a system of coset representatives of E L /E P L . Let 
Lj = L(( p ) where is a primitive pth root of unity. Let us first observe that 
an element a e L belongs to (L x ) p if and only if a e (L x ) p . This is obvious 
since [L x : L] is prime to p. For each i, let M t = L 1 (ef lp ) where ef lp denotes a 
root of x p — £i in C. By our observation above and our choice of , we see 
that [M t : LJ = p. Since M-JL^ is cyclic, the density of primes of L x that split 
completely in M, is 1/p < 1. Thus there exist infinitely many Li-primes that 
do not split completely in M, . Now choose distinct Li-primes % that are to 
satisfy: 

1. % does not split completely in M t ; and 

2. % does not divide 2£> Ll . 
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Let denote the L-prime that % divides. 

Suppose s { = pf for some /?, e L? >% , then Lj^.(£* /p ) = Li ^ so that 
splits completely in Mi/L x contrary to the choice of %. We conclude that 
s { $ (L x * <p.) p . All the more so, s t 4 (L p .) p Moreover, p f does not divide 20 L . 

Now L P ./(L P .) P is a finite elementary abelian p-group. Since e f ^ (L p .) p , we 
can choose a subgroup H, of L p . of index p not containing s ( . By local class 
field theory, there exists a cyclic extension F { of degree p over L p . with 
N Fi/Lp (F*) = Hi, Note that e f e U Pi , the unit group of L Pf , so that U Pi £ H t 
and hence FJL Pi is ramified. Let x Pi - L P| -► T be a continuous local character 
with Ker(x Pl ) = Hi where here T denotes the set of complex numbers of 
absolute value 1. If S^ denotes the set of real infinite primes of L, then for 
each p^ e , let x Poo : R -+ T be the trivial character. Now set S = {p f : 1 < 
i <s} u S 0 o and apply a variant of Grumwald’s existence theorem [1, p. 103, 
Theorem 5]. Since no p f divides 2€> L , S 0 £ S where S 0 = {p | p divides 20 L }. 
Thus there exists a continuous global character x'. C L -> T of order p (the x Pi 
have order p and the x Poo have order 1) whose restriction to L p is x P for all 
pe5 (here C L denotes the idele class group of L and we view L Pi embedded 
in C L in the usual manner). Let M = Ker(x). Since x is continuous and T 
Hausdorff, Ker(^) is closed. Moreover, [C L : M] = p since x has order p and 
Im(x) ^ T is cyclic. Thus M is an open subgroup of C L of finite index p. By 
class field theory, there exists a cyclic extension of L of degree p such that 

We shall now show that K t has the desired properties. If P, is a prime of 

dividing Pi Ox.. then 

**,.„!* (*i. *) = M n l; = Ker(x) n L*. 

= Ker (z|t pl ) = Ker(x Pj ) = //,, 

and so e ; *N Kl Pi/Lvi (K* p y Thus e, * N Kl/L (K?) for 1 <i <s and (1) fol¬ 
lows. Also, if Poo e S,, then N Ki vJLp jKl V J = so that Poo is 
unramified in /L. In addition, by our original assumption on L, s > 1 and 
so, as we observed during the construction, some finite prime of L ramifies in 
K 1 /L Thus (2) is also satisfied. 

The construction of K 2 is similar to that of and we will leave out 
some details at the points of similarity. 

Let 1, rj u rj 2 ,..., rj t denote a system of coset representatives of E K JE p Kl . 
For each i, let K x t (r]l lp ) where rjf lp denotes a fixed root of x p - rji in C. 
Choose distinct L-primes q { which are to satisfy: 

1. q* splits completely in K i /L; 

2. q f does not divide 2D L ; and 

3. if is a K r prime over q f , then rj t $ (K^ a .) p 
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To see that such q { exist, we first notice that there exist infinitely many 
L-primes that split completely in K^pj/L but do not split completely in 
K lt i(C p )/L. Then by an argument similar to the one used in the construction 
of K u we see that properties 1 and 3 are satisfied for such a q f . See Fig. 1. 



Fig. 1 


Since q, splits completely in K 1 /L, K 1 ^=L q . and by 3, rji<£ (L q .) p . 
Again, there exists a cyclic local extension /L q . of prime degree p for which 
riitN Mi/Lq (Mn- Set 

A = {L-primes p : p ramifies in K 1 /L}, 

S 0 = (L-primes p : p divides 2D L }, 

S = {q f : 1 < i < t} u S 0 u A. 

As before, define continuous local characters x*r L* -► T with Ker(/ q .) = 
N M . ILtt (M *). In addition, for each p e A u S 0 , let % p : L p -* T be the trivial 
character. Since J~] peSo X p — there exists a continuous global character 
X: C L -* T of order p for which x \l p = Xp f° r all p e S (again see Artin and 
Tate [1, p. 103, Theorem 5]). Let K 2 be the cyclic extension of L of degree p 
determined by *. It remains to show that K 2 satisfies the desired properties. 

Let be a prime of K { over q*, let % be a prime of K = K X K 2 over Q ( , 
and let p- be the prime of K 2 that % divides. By construction of K 2 , 
r\i i Nk 2 , Pi'iLq (& 2 , w)‘ But s P lits completely in K Y /L and so p; splits com¬ 
pletely in K/K 2 . Hence L q . = K u ^. and K 2tPi . = K Vi , and we see that 
tit i N K%f/Kli 0 (K p .) for 1 < i < t. It follows that N K/Kl (E K ) = E p Kl and so (4) 
is satisfied. Let peX uS 0 . If is a X 2 -prime above p, then 
NK 2 ,y/Lp(K 2 ,y) — Ker(x) n L p — Ker(x | Lp ) = L p so that L p = X 2 , y and p 
splits completely in K 2 /L. In particular, p is unramified in K 2 /L Thus (3) is 
also satisfied. This completes the proof of Theorem 3. | 
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1. Introduction 

Suppose we are given N points in U k , where k > 1 and U k is the unit 
cube, consisting of points y = (y l9 ..., y k ) with 0 < y { < 1 (i = 1,..., k). For x 
in U k , write Z(x) for the number of points among the N given points that lie 
in the box 0 < < x { (i = 1 , ..., k\ and put 

D(x) = Z(x) — Nx l • • • x k . 

The irregularity of the distribution of the given N points may be measured in 
various ways by the behavior of the function D(x). For example, one may 
consider the L p -norms 


(Here and throughout, except where noted, the integrals are over U k .) Roth 
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[4] proved that for some cfk) > 0, 

II D ||2 > C!(/c)(log N) (fc - 1)/2 , (1.1) 

and Davenport [2] showed that for k = 2 this is best possible, except for the 
value of the constant c x (2 ). 

Theorem 1 For p > 1 there is a constant c x (k , p) > 0 such that 

||0|| P >Ci(fc, P)(log (1.2) 

Since [| D |j p cannot decrease with p, this estimate for 1 < p < 2 is a 
sharpening of (1.1). Moreover, it is best possible for k = 2 and 1 < p < 2. 
Perhaps it is best possible, except for the value of c x (k 9 p), for every k > 1, 
p > 1. We cannot give an estimate of the same strength for || D || x . But with 

E(x) = D(x)(log JV)" (k “ 1)/2 , 

we have 

Theorem 2 For q>k — \ 9 there is a c 2 = c 2 (k , q) > 0 such that 


| |c 2 £(x)| |log | c 2 E(x )| |* dx > 1 . (1.3) 


This contains Theorem 1: For pick c 3 = c 3 (q) > 0 so small that 
z | log z I® < \ for 0 < z < c 3 . Then the integral of | c 2 £(x) | | log | c 2 £(x) | |* 
over points x with | c 2 £(x) | > c 3 is > But in this domain | c 2 £(x) | p > 
c 4 | c 2 £(x) | | log |c 2 £(x) | I®, where c 4 > 0 is the minimum of z p ~ 1 1 log z |~ 4 
for z with z > c 3 . Thus 


J |c 2 £(x)| p dx > jc 4 , 


and ( 1 . 2 ) follows. 

Sobol [ 6 ] noted that 



c s(k) 

N ’ 


and in [3, 6 ] there is the statement that one can always give examples where 
||D|| x < 5 . But Sobol gives such an example only for N = 1! In fact, we have 

Theorem 3 For some c 6 = c 6 (k) > 0 and for large N (say N > e 100 ), 

II D ||! > c 6 log log N /log log log N. 

The function on the right-hand side is the same as the one given by 
Aardenne-Ehrenfest [1] in her estimate of j[ D ] ^, i.e., the supremum of 
| D(x) |. As far as I can see, this is a coincidence. It is possible that our 
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estimate of \\D\h could be somewhat improved at the cost of a further 
complication of the proof. 

For the sake of completeness we mention that at present the best esti¬ 
mates for || D || oo are || D > c 1 (/c)(log N ) (k_1)/2 as a consequence of (1.1), 
and || Z> > c 7 log N for k = 2, according to [5]. 

The following question is open: Is there a function/(N), decreasing to 0, 
such that for any C > 0 the measure of the set of x with | D(x) | < C, is 
0(f(N))? 

2. The Method 

Roth obtained his estimate by constructing an auxiliary function F(x) 
with 

| F(\)D(\) dx > c 8 (/c)(log Nf ~ 1 (2.1) 

(where c s (k) as well as all of our constants are positive) and 

J F 2 (x) dx < c 9 (k )(log Nf~ \ ( 2 . 2 ) 

These two inequalities, together with Schwarz’ inequality, yield ( 1 . 1 ). It may 
be shown that the very same function F(x) also has 

J F 2 m (x) dx < c 9 (k, m)(log N) m(k ~ u (m = 1 , 2 , ...). ( 2 . 3 ) 

This is rather surprising since F(x) was constructed with only (2.1) and (2.2) 
in mind. Now by (2.3), 

II F II 2 m < c 10 (k, m)(log jVf- 1)/2 (m= 1 , 2 ,...), 

and since || F || r cannot decrease as a function of r, we have 

II F ||r < Cn(k, r)(log N) (k ~ 1)12 (r > 0). (2.4) 

Holder’s inequality yields 

| F(x)D(x) dx < || F || r || D ||p, 

provided p > 1, r > 1 with (1/p) + (1/r) = 1. This inequality, together with 
(2.1), (2.4), gives (1.2). 

We shall use a slightly modified function F(x) with (2.1) and with 

J F 2 m (x) dx < ( 2 m)" l<2 ' I_3, (v + l)" 1 **- 1 * ( m = l 5 2 , ...), ( 2 . 5 ) 
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where v is the integer determined by 

2N < 2 V < 4N. (2.6) 

This explicit estimate will be needed for Theorem 2. (Roth’s function F(x) 
would give (2.5) with the exponent m(2k — 3) replaced by m(2k — 2), which 
would yield (1.3) under the slightly more restrictive condition that 
q> k — 1.) 

In order to prove the more difficult Theorem 3, we shall need certain 
finite sums, rather than integrals over U k . For Theorems 1 and 2, we could 
also have used finite sums, which perhaps would have made the exposition 
more uniform. But the notation with integrals is slightly simpler, and by 
using integrals we stay closer to the established style, as presented by 
Kuipers and Niederreiter [3] or Roth [4]. 


3. Generalized Rademacher Functions 

Every x in 0 < x < 1 may uniquely be written as 

00 

* = 

j=l 

where the digits Pj(x) are 0 or 1, and where the sequence of digits does not 
end with 1, 1, .... The functions 


K r ( x ) = (-1)^<*> (r = 0, 1,...) 

are the Rademacher functions. (This notation is more convenient for us than 
the more common R r (x) = ( — l)^**.) 

An r-interval will be an interval 

m2~ r < x < (m + l)2~ r , 

where m is an integer in 0 < m < 2 r . An r-function , where r is a nonnegative 
integer, is defined as a function /(x) in 0 < x < 1 such that in every r- 
interval, either/(x) = R r (x) or/(x) = - R r (x). There are 2 r r-intervals, hence 
2 2r r-functions. An r-function has 

f / (x) dx = 0. 

J o 

Lemma 1 Suppose f l9 ...,/ are r u ..., rfunctions, respectively. Suppose 
an odd number among r l9 ..., r, is equal to their maximum r = max(r t ,..., r t ). 
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Then the product / x (x) • • - /,(x) is an r-function, and 

f fi{x) -fi(x)dx = 0. 

J o 

Proof It suffices to observe that the product of an odd number of 
r-functions is an r-function, and the product of an r-function and an s- 
function with s < r is an r-function. 

Given a fc-tuple r = (r„ ..., r k ) of nonnegative integers, put 

|r|-r,+ ••• + <•,. (3.1) 

For x in U\ write 

i?,(x) = R ri ( Xl ) • R n (x k ). 

An r-box is defined as a box B = I t x • • • x I k , where f is an /^-interval for 
j= 1,..., k. An r-function is a function/(x) on U k such that in every r-box 
either/(x) = R r (x) or /(x) = -R r (x). There are 2 2 ' rl such r-functions. For 
given x 1 ,...,Xj- 1 ,x J+ 1 ,...,x k , an r-function is an r,-function in the variable, 
Xj. Combining this remark with Lemma 1, we obtain 

Lemma 2 Suppose f u ...,/, are, respectively, r„ ..., r ,-functions. Write 
r i = fai, • • • > r ik) and r = max(r u ,..., r lk ). Suppose arrodd number among r lk , 
..., r lk equals r. Then 


j M x) • • /i(x) dx = 0. 

In the lemma the kth coordinate is singled out. The same result is of 
course true for the jth coordinate with 1 < j < k. 

Lemma 3 Suppose k = 2, v > 0. Suppose for each r with | r | — v we have 
picked an r-function f T . For r l5 ..., r 2m with | r x | = — = | r 2m \ = v, put 

•» r 2m) = { /r,( X ) • /r 2 » ( 3 - 2 ) 

Then 3(r x , ..., r 2m ) = 0, unless there is a decomposition of{ 1,..., 2m} into m 
pairs (itJt), ..., ( i m , j m ) with r it = x jt for t = 1 ,..., m. 

Proof For m = 1 we know from Lemma 2 that 3(r l5 r 2 ) = 0 unless 
r i 2 = r 22 » i- e * unless = r 2 in view of | r x | = |r 2 1. For m > 1, we know 
again that the integral is 0 unless r fl = r jt for some =(= j v Since the square 
of an r-function is 1, it follows that in this case 

3(*i,»• • ? r 2m ) = 3(r l5 ..., r fl , ..., , r 2m ), 

where the dots mean that r It , r jl have been omitted. The proof is now 
completed by induction. 
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4. Estimation of Moments 

Lemma 4 Suppose k>2,m> 1 and v > 0. Suppose that for every r with 
| r | = v we are given an r-function f t . Put 

F(x)= £f r (x). (4.1) 

|r| = v 

Then (2.5) holds. 

Proof Let M = M km (v) be the maximum of J F 2m (x) dx, taken over all 
functions F(x) of the type (4.1), where each/ r (x) is either an r-function or is 
identically zero. We shall prove that 

M km (v) < (2m) m(2k-3) (v + l) m(k “ 1) . (4.2) 

Let the f T be chosen such that M is attained for F(x) given by (4.1). We then 
have 

M = [ F(\) 2m dx= £ *v* Z 3 ( r i’ •’ V 2 m% ( 4 - 3 ) 

with 3(r„ r 2m ) given by (3.2). 

Suppose at first that k = 2. Then by Lemma 3 we may restrict ourselves 
to summands having r fi = r jt (t — 1, ..., m) for a certain division of {1, ..., 
2m} into pairs (i„ jj,..., (i m , j m ). If this is the case, the integral 3(r 1; ..., r 2m ) 
is 0 if some of the functions f n , ...,/ r2m are identically zero, and it is 1 
otherwise. The number of divisions into pairs is (2m — l)(2m — 3) • ■ • 3 • 1 < 
(2m) m , and the number of possibilities for r fl , ..., r im is (v + l) m . Thus for 
k = 2 we obtain the desired estimate 

J f(x) 2m dx < (2mf(v + If. 

Now let k > 2. For each summand on the right-hand side of (4.3) put 
r = max(r u , ..., r 2m> k ), and write h for the number of components r lk ,..., 
r 2m , k which are equal to r. Then 3(r t ,..., r 2m ) = 0 unless h is even, i.e., unless 
h = 2u with 1 < u < m. There are (|“) ways to pick 2 u elements out of (1,.. 
2m}, and 3(r t ,..., r 2m ) is invariant under permutations ofr,,..., r 2m . Thus if 
the numbers u, v are always connected by u + v = m, then 

M= £ l. j X ^2* ^( r l’ •••’ T 2u > S l> S 2ti)> 

u= 1 \ ZM / r — O 

where is the sum over 2«-tupfes r u .... r 2u with || = • • • = |r 2u | = v 
and r lk = • ■ ■ = r 2u> k = r, and £ 2p is the sum over 2o-tuples s l5 ..., s 2 „ with 
| Sl | = • • = |s 2 „ | = v and s ik ,..., s 2v<k < r. Thus if 

^(x) = I' r /r(x), 
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where the sum is over r with | r | = v and r k = r, and if 


*,(*) = / r (x), 


where the sum is over r with | r | = v and r k < r, then 

m = £ (J) ? 0 1 rfx - ( 4 - 4 ) 

By Holder’s inequality, 

) ujm 

The sum defining A r (\) is over r = (F, r) where r = (r lt ..., r k _ l ) runs 
through (k — 1)-tuples with |r| = v — r. Hence for given x k , the function 
-4 r (x) in x u ..., x k - 1 is of the type of f’(x), but with A; — 1 in place of k and 
v — r in place of v. Thus if we proceed by induction on k, we may conclude 
that 

J ,l r 2 m (x) dx < (2m) m(2k ~ 5) (v + ir- 2) . (4.6) 

The function B r (x) is like F(x), except that certain f t in the sum (4.1) are 
replaced by 0. Thus by our definition of M = M km (v), 


J A? u (x)B?"(x) dx < (j 



| B 2m (x) dx < M. 

Combining (4.4)-(4.7) we obtain 

m /?m\ 

M < (v + 1) £ K ((2m) 2k - 5 (v + 1 f~ 2 ) u M vlm . 

u=i \2u/ 

Dividing by M(v + 1) and putting M = L m we get 

(v + l)- 1 < £ ( 2 . W )((2m) 2k - 5 (v + lr 2 ^ 1 )-. 

u= 1 \^ U ) 

So with K — (2m) 2lt_5 (v + we have 

m ( r )m\ m 1 


Since 




we obtain 4 m 2 K > (v + 1) \ or 

L< (2m) 2 * -3 (v + If -1 , 


(4.7) 


(4.8) 


or (4.2). 
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5. Construction of an Auxiliary Functionf 


Lemma 5 Suppose | r | = v with 


2 V > 2 N. 

(5.1) 

Then there is an r-function f r with 


J / r (x)D(x) dx> N • 2 -v-2k “ 1 . 

(5.2) 


Proof The integral in (5.2) may be decomposed into integrals over 
r-boxes. We choose f r (\) such that the integral over every r-box is 
nonnegative. 

If B is the r-box given by m i 2 _ri < x t < (m* + l)2" n (i = 1,..., k), let B' 
be the smaller box m i 2~ n < x f < (m f -f \)2~ ri (i = 1, ..., k). Then 

f Kr(x)0(x) dx = f X • ■ • Z (- i)* 1 + +£ ‘ 

J B J B ' £ 1=0 £*=0 

X £>(>>! + e 1 2~' 1_1 , .... y* + e k 2~' k ~ l ) dy. (5.3) 

Now 


I ••• L + (5.4) 

£1 = 0 = 0 

is the number among the given N points that lie in the box 
< Xi < y t + 2 ~ r ‘~ 1 (i = 1,..., k), which is contained in B. Thus this sum is 
0 if B contains none of the given N points. On the other hand, 

i ••• Z M) £1+ - +£ ‘(yi +£ 1 2--- 1 ) -(y k + e k 2~' k ~ 1 ) 

£1=0 £jt= 0 

= (— l) fc 2-' r '-* = (-l)*2“ v “*. (5.5) 

If B contains none of the given N points, then 

f R r (x)D(x) dx = (— l)* +1 N2 -v- *2 -v- * 

J B 

since B' has volume 2~ v ~ k . We observe that in view of (5.1), at least half of 
the 2 V r-boxes B contain none of the given N points. Thus we obtain (5.2) for 
a suitably chosen r-function f T . 

Given v with (5.1), we construct an r-function f r according to Lemma 5 
for every r with | r | = v, and we define F(x) by (4.1). Then since the number 


t As noted in Section 2, our auxiliary function is almost the same as the one of Roth [4]. 
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of /c-tuples r with |r| = v is > c l2 (k)(v 4- 1)* *, we obtain 

J F(x)D(\) dx > c l3 (k)(v 4- l) fc_1 2 ' V N. (5.6) 


6. Proof of Theorem 1 

Let v be the number with (2.6). Then (5.6) yields (2.1), and (2.5) is true by 
Lemma 4. Theorem 1 follows. 

7. Proof of Theorem 2 

Write exp x = e* and 

E 0 (x) = D(x)(v + l)“ (k_1)/2 , G(x) = F(x)(v 4- i)-** -1 )^ (7.1) 

where F(x) is the auxiliary function constructed in Section 5. 

Lemma 6 For q> k — f, 

J I G(x) | exp( | G(x) | i; «) dx < c 14 (k, q). 

Proof Since || G || a is nondecreasing in a, we have || G ||„ < || G || 2m for 
a < 2m, whence by (2.5), 

J |G(x)| a dx < G(x) 2m dxj < (2 hi)” ( *~ <3/2)) . 

Therefore 

f l G ( x )l ex P( | G(x) | 1/<r ) dx = E ^ j |G(x) j 1+w « ) dx 

J h = 0 n - J 

< t + (h/q)y l+ik * m ' i3l2)) , 
h = 0 kl 

(7.2) 

where </?> denotes the next larger even integer to /?. For large h, we have 
<14* (h/q)y < 2h, and an exponent 

(l + (h/q))(k- f) < (1 - 2S)h 

with 8 = (1 — ((k — j)/q)/ 4. Since h\ > (2 h) (1 ~ 9)h for large h, a typical sum¬ 
mand is < (2h)~ th , so that the sum in (7.2) is convergent, say to c l4 (k, q). 
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With v determined by (2.6), we have 

| G(x)£ 0 (x) dx > c 15 (k) > 0 (7.3) 

by (5.6), (7.1). Pick c 16 = c 16 (/c, q) so large that c 14 e~ C16 < ^c 15 . Let S be the 
set of x in U k with 

|£o(x)| < exp( | G(x )\ l,v — c 16 ). 

By Lemma 6 the integral of G(x)£ 0 (x) over S is < jc 15 . Hence the integral 
over the complement T of S is > ic 15 , by (7.3). On T, 

(log |£ 0 (x)| +Cl6) 9 > |G(x)|, 

so that 

J |£ 0 (x)| |log|£ 0 (x)| + c 16 1* dx > | £ 0 (x)G(x) | dx>$c 15 . 

Choose c 17 > 0 so small that z| log z + Ci 6 | , <ic 15 for 0 < z < c 17 . 
Then 

[ |£o(x)| I log I £ 0 (x) I +c 16 | <J dx>ici 5 , 

Jy 

where V consists of \eU k with |£ 0 (x)|>c 17 . We now choose 
c 2 = c i(k , q) so large that 

c' 2 = c 2 ((v -h l)/log N) {k ~ i)/2 

has 


d 2 z J log c' 2 z \ q > 4cj5^z | log z 4- c 16 \ q 
for z > c 17 . With this value of c 2 , 

c 2 E{\) = c 2 D(x)( log N )~ ik ~ l)f2 = c 2 E 0 (x) 

satisfies (1.3). 


8. Estimation of Sums 

We now turn to Theorem 3. For simplicity of notation, we shall assume 
that k = 2. The constants y u y 2 , ... will be positive and absolute. 

Suppose we are given a discrete set of points in the quadrant y x > 0, 
y 2 >0. For x = (x u x 2 ) in this quadrant, write Z(x) for the number of 
points among the given discrete set that lie in the rectangle 0 < y x <x u 
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0 < y 2 < x 2 . Put D(x) = Z(x) — Nx 1 x 2 . Now if Q is a square 

m, < x, < m, + 1 (i = 1, 2) (8.1) 

in our quadrant, the results proved above about U 2 remain true for Q. More 
precisely, if the number N(Q) of points among the given discrete set in Q 
satisfies 

2 V > 2N(Q), (8.2) 

then there is a function F(x) defined on Q such that 

f F(x)D(x) dx > ^(v + 1)2“ V N (8.3) 

J Q 

(in analogy with (5.6)) and 

f F 2m (x) dx < (2m) m (v + l) m (m = 1, 2, ...) (8.4) 

J Q 

(in analogy with (2.5)). For example, in order to derive (8.3), we note that the 
alternating sums (5.4) depend only on the points of the given discrete set that 
lie in Q. 

Given a square Q determined by (8.1) and given a nonnegative integer g, 
a grid T(Q, g) will be a set of points x = (x t , x 2 ) of the type 

X; = m, + & + /, 2 - ' 1 (i = 1, 2), 

where f j, <^ 2 are fixed with 0 < ^ < 2~* ( i = 1, 2), and l u l 2 run indepen¬ 
dently through the integers 0, 1,..., 2" - 1. The grid T(Q, g) is contained in 
Q and the number of its points is 

|r(Q,/x)| =4". (8.5) 

Because of the freedom in choosing % l9 £ 2 , there are infinitely many grids 
T (Q, {i) contained in Q. 

Given integers \i > v > 0, we may replace the integrals (8.3), (8.4) by 
sums over the points of a grid T(Q, fi). This is so because in the alternating 
sums (5.3}-(5.5) we never get outside the grid. We sum up our results in 

Lemma 7 Let Q be a square of the type (8.1). Let ft, v be integers with 

(i > v > 0 and 2 V > 2N(Q). (8.6) 

Let a grid T(Q , p) be given. Then there is a function F(\) defined on T(g, p) 
with 

im^r 1 Z F(x)D(x) > yi ( V + 1)2-”2V, (8.7) 

iene,M) 

|r(e,F )| _1 Z F 2m (x) < (2w) m (v + l) m (m = 1 , 2, .. .). (8.8) 

xer 
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We note that | F(x) | < v + 1 since F(x) was constructed as the sum of 
v + 1 r-functions. 

More generally, we shall allow squares Q of the type 

m t 2~ x < x t < 0 m t + 1)2 -A (i - 1, 2), , (8.9) 

where A is a nonnegative integer. A grid T(Q, p) will be a set of points 
x = (x u x 2 ) of the form 

*i = 2 _A (m i + + h 2“ M ) (i= 1,2) 

where £ ls £ 2 are given with 0 < < 2~ M and where l l9 l 2 run through 0, 1, 
..., 2 M — 1. Such a grid is contained in Q and its number of elements is given 
by (8.5). 

Lemma 8 Let Q be a square of the type (8.9). Let p, v be integers satisfy¬ 
ing (8.6). Let a grid T(Q, p) be given. Then there is a function F(x) defined on 
T(Q y p) with (8.8), with 

|r(Q, p)\~ l X F(x)D(x) > yfv + 1)2 -v- 2A /V, (8.10) 

xeT(Q, n) 

and with 


| F(x) | < v + 1. (8.11) 

Proof The dilation T with Tx = 2 x x maps Q into a square Q' of side 1 
and it maps T(g, p) into a grid T(g', p). If P is the given set of points, put 
P' = T(P) and define Z'(x) with reference to P\ Put AT — 4~ X N and Z)'(x) = 
Z'(x) — N'x l x 2 . Then 

D'(Tx) = Z\Tx) - N f 4 x x t x 2 = Z(x) - Nx t x 2 = D(x). 

We apply Lemma 7 with T{Q\ p) in place of T(Q, p), and with AT, D f in place 
of N, D. We obtain a function F'(x) defined on r(Q\ p). The function 
F(x) = F'(Tx) = F'(2 A x) is defined on T(Q , p) and has the desired 
properties. 

Lemma 9 Let A, u, v, Q , T(Q, /*), F(x) be as in Lemma 8. Suppose X > 4. 
Tbe number (p(X) of points x ofT(Q , /i) with 

| F(x) | >X(v + 1) 1/2 

satisfies 

<p(X)< I r«2, //)I exp(-2- 4 X 2 ). 

Frao/ We have 

<*>(*)1 r((2, /x) |- 1 X 2m < (2m ) m (m = 1, 2, ...) 
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by (8.8). Pick an integer m with 

1 < (l/4e)AT 2 <m< (l/2e)X 2 . 

Then 

?(X)|r«>, /i)!" 1 < (2/n2f _2 ) m < e~ m < exp(—2 -4 A' 2 ). 


9. Preparations for Theorem 3 

The basic idea of the proof is as follows. For most squares Q of the type 
(8.9), N(Q ) should be not much larger than N4~ x , where N is the number of 
our given points in U 2 . For such a square Q, we may choose v with 
2 V > 2N(Q) such that 2 V has the order of magnitude N4~ x , and hence 2 V+22 
is about equal to N. Thus (8.10) becomes 

in&^r 1 Z F(x)D(x) > y 2 ( v + 1), (9.1) 

xgHQ, h) 

for most squares Q. The auxiliary function F(x) has |F(x)| < v + 1, but 
from Lemma 9 it is clear that it is usually not much larger than (v + 1) 1/2 . If 
we knew that the summands in (9.1) where |F(x)| is small compared to 
v + 1 give a large contribution, we could conclude that 

|r«Mr I |f>(*)| 

xe r«2, fi) 

is large. If we knew this to happen for many grids, we would ultimately be 
led to a good lower bound for || D ||j. It will therefore be necessary to pay 
attention to the case when the relatively few summands of (9.1) where 
| F(x) | is almost as large as v + 1 give a large contribution. In this case there 
is a small subset A of T(Q, /i) such that 

ine,/<)rE \D(x)\ >y 3 . (9.2) 

XE A 

In order to go beyond such a constant to a function that tends to infinity 
with N, we shall have to apply Lemma 8 with different values of X, /i, and v. 
We now proceed to carry out this program. Let p be the integer with 

AN <4 P < 16 N. (9.3) 

Putting 

v(u)=(12)V) 2 (m = 1 , 2 , ...), (9.4) 

we have 

w 4 v<u-i) + i ex p( — 2 - 2 u“ 2 v(m)) < 4u exp (2 v(u — 1 ) — 3 v(u — 1 )) < 1 ( 9 . 5 ) 

for u > 1. Let t be an integer with 

v(t) < 2p. 


(9.6) 
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and write 


A(u) = p - ^v(n), n(u) = v(w) +1 (u = 1,..., t). (9.7) 

Then A(«) + p(u) = p + ^v(u) + 1 is increasing in 1 < u < t, and 

2 -v(»)~2Mu) N = 4 ~P N > 2 -4 ( 9 . 8 ) 

A square Q will be written as Q u if it is given by inequalities 

m, 2~ Mu) < x, < (m, + l)2~ m (i = 1, 2) 

with integers m, in 0 < m i < 2 Mu \ For 1 < v < u < t, the number of squares 
Q v contained in given square Q u is 

A(v, u) = /l(u> . 


For v < u and a given square Q u , write A(v, Q u ) for the number of squares 
Q v c Q“ with 


Clearly, 


N(Q v )<2N4~ Mv \ 


(9.9) 


0 < A(v, Q u ) < A(v, u). (9.10) 

The right-hand side of (9.9) is 2 N times the area of Q v , while the average of 
N(Q V ) over all squares Q v is N times the area of Q v . It follows that (9.9) must 
be true for at least half of the squares Q v , i.e., for at least squares Q v . So 
for v < u, 


IA(V, Qr)±&« v \ (9.11) 

QU 

where the sum is over all squares Q u . 

For 1 < u < t and every square Q u > put 

h(Q u ) = t A (v> Q u )/Av, u), (9.12) 

v=i 


so that by (9.10), 

0 < h(Q u ) < u. (9.13) 

We shall write T“ for a grid r(Q u , n(u)). The number of points of T“ is 
B(u) = | F" | = (9.14) 

Put 


B(v, u) = B{u)B(v)~ 1 = 4 m<«)-m ( ») 

For 1 < v < u < t, the square Q u is the disjoint union of A(v, u) squares Q v . 
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The grid T“ = r(Q u , g(u)) has “mesh” 2~ m ~ piu) (i.e., its points have horizon¬ 
tal and vertical distances of this size), and hence P is the union of A(v, u) 
grids of the type T(Q V , g(u) + 2(«) — A(d)), which also have this mesh. In 
turn, since g(u) + 2(«) — X(v) > g(v), each grid T(Q V , g(u) + a(u) — /(;')) is 
the disjoint union G f 2 A< “ )+ '‘ ( “ )_A<l ’ )_ '‘ (t ’ ) grids T(Q v , We write all this up 

in 

p = y (j'p. (9.i5) 

Qv czQv 

Here T" = T (Q“, g(u)), the outside union is over the A(v, u) squares Q u c. Q v , 
and the inside union (J' is over 

2 m+rtu)-Mv)-m») = q( v u)A(v, u)~ l 

grids P = r(Q”, n(v)). 

Given a grid P, write 

^(P) = B(u )- 1 X |Z)(x)|. (9.16) 

x£r» 


10. A Lemma 

Lemma 10 For 1 <u <t and a grid P = r(Q u , g(u)), 

g(n > y 4 M6")- («>.i) 

Proof We take 

74 = yi 2“ 6 ( 10 . 2 ) 

where y x is the constant of Lemma 8 . 

We begin with u— 1, in which case 

Me 1 ) = ^(i,e 1 )> 

which is 0 or 1. The interesting case is when it is 1; then N(Q 1 ) < 2N4~ Mi \ 
or by (9.3), (9.7), 

2N(Q 1 ) < = N4 l ~ p 2' l(1) < 2 V<1) . 

Thus ( 8 . 6 ) is true for g = g(l), v = v(l), T(Q 1 , /i(l)), and we may apply 
Lemma 8 . Since |F(x)| < v + 1, (8.10) yields 

g(P)=|r 1 |- 1 £ |D(x)| > 7 i 2 -v(1)_2A(1) N > y, 2~ A > y 4 

xeri 


by (9.8). 
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We now proceed to the induction from u — 1 to u, where 2 < u < t. By 
(9.15) with v = u — 1, 

g(r u ) = B(u- l.u )" 1 X (10.3) 

qu-i c qu r«-i 

where Z' is a sum over B(u — 1, u)A(u — 1, u ) -1 grids P 1-1 <= Q“~ l . 

On the other hand, for 1 < v < u — 1, we have A(v, u ) = 
A(v, u — 1 )A(u — 1, u) and 

A(v, Q u )/A(v, u) = Z A ( v ’ ") 

= A(u - 1, u)~ 1 Z A ( v » x )/-«4(p, « - 1)- 

Qu- 1 cQ“ 

It follows that 

h(Q u ) = A(u , (2 tt ) + >!(« - 1, «)“ 1 X *). (10.4) 

It will be convenient to write this as 

h(or) = A(u,or) + B(u- i,«r‘ z z(10.5) 

q u ~i<=qu r« -1 

where the inner sum Z' is over B(u — 1, u)A(u — 1, u )~ 1 grids T " -1 <= Q u ~ \ 
as in (10.3), and the summand fi(()" _ ') is independent of r“~ 

Our inductive assumption yields g(r“~ 1 ) > 1a HQ U ')• Comparing 
(10.3) and (10.5), we thus obtain the desired relation (10.1), provided 
A(u, Q u ) = 0. It remains to deal with the case when 

A(u, Q u ) = 1. (10.6) 

Now the double sum in (10.3) and (10.5) is altogether over a set $ of 
B(u — 1, u) grids P* -1 . To deal with the case (10.6), it would suffice to 
construct a subset % of ® such that (with the notation /i(r u ~ *) = h(Q u l ) if 
r"- 1 = r(e tt ~ 1 , p(u- 1))), 

b(« — i, u)~ 1 Z Mr“- 1 )-y 4 Mr“- 1 ))>y 4 - 
r>-us 

Since h(Q u ~ 1 ) < u — 1 by (9.13), it would suffice to have 

Z 0 (r“ -1 )>y 4 B(u- 1, u) _1 u|£| + y 4 , 

r«- 1 e I 

where | % | is the number of elements of 2 ; or 

Z Z |X)(x)| >y 4 B(«- 1 ,«)- 1 «|I| +y 4 . 

r«-l e I x e r«~ i 
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1. Introduction 

Let F be an algebraic number field, /i an integer in F, x m — n irreducible 
over F, and # a prime ideal in the ring of algebraic integers in F with 
(m, 2?) = 1. We are interested in discovering how the ideal 0* decomposes 
when considered as an ideal in F(^ 1/m ). It was shown, in Mann and Velez [2, 
p. 132], that we can reduce this problem to the situation where (f.i, ■2 > ) = 1. 
Under this assumption we have that 0 does not ramify in F(n 1/m ). In Mann 
and Velez [2] we also defined F T m (^), the counting function for the decom¬ 
position of 0> in F(fi llm ), where F y V m (0 > ) merely counts how many ideals 
there are of each degree in the decomposition of 0 in That is, if 

F 'v m m= idUii 

i =i 

then factors in into g t ideals of degree/;, for i = 1, ..., n, and 

m = I?=i Si/;- 

t This work was supported in part by the U.S. Energy Research and Development Admin¬ 
istration (ERDA), Contract No. AT(29-l)-789. The author is currently at Sandia Laboratories. 
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Let Q* denote the nonzero rational numbers and Z the ring of integers 
and consider Z(Q*), the group ring of Q* with coefficients in Z. Let us 
denote elements in Z(Q*) by njrj, where n ( e Z, r f e Q*. Then Z(Z*) is 
imbedded in a natural way in Z(Q*). Moreover, p'Pjf?) = £ #,[/] can be 
considered as being an element in Z(Z*) since g t e Z and/- e Z*. Hence, we 
can define an addition and multiplication for counting functions by con¬ 
sidering the counting functions as elements of Z(Z*). 

In Mann and Velez [2] we were able to explicitly compute p^ViffP), 
where Z is a prime, (Ip, = 1. Furthermore, we showed that if m = ?*’ 

and Z|iV(^) — 1, where N(0>) denotes the norm of & over Q, then 

f T M (^)=n 

In this paper we shall give a recursive procedure for computing F 4 / w (^ > ) 
in terms of the counting functions where Z is a prime. Furthermore, 

we shall investigate the phenomenon where 

F 'p m (^)= rir'Mn m=f\i?. 

i= 1 i — 1 


2. Theorems and Proofs 

By o(p) modulo & we shall mean the multiplicative order of p in the 
group of units modulo 

Lemma 1 Let x m — p be irreducible over F, / a prime , (Z, m) = 1. If 
l n || o(p) modulo 0* (where T || b means Z" | b , l n+1 \ b), rZien T || o(p llm ) modulo 
£Pi, where is any prime divisor of in F(p 1/m ). 

Proof The proof follows by applying Theorem 2 in Mann and Velez [2] 
and induction. | 

Let Z | N(&>) — 1. Then from Rules 1-4 in Mann and Velez [2], we see that 
depends only on three parameters, namely, Z c , l k , where 
l k || N(0>) - 1, and Z" = (o(p), l k ). Set JV(>) = p f . 

If Z \ N(0>) — 1, then F x ¥ lc (0*) depends only on three parameters, namely, 
r, l c and Z w , where l w || p fr — 1 and r is the order of p f modulo /. To emphasize 
this, we set p'Pi^) = C(Z C , o(p ), //), where C is computed according to 
Rules 1-5 in Mann and Velez [2]. 

From the above comments we have the following: If (q, l) = 1, then 

C(l c , q ■ o(p), /) = C{r, o(fi), pf). (1) 

If ( q, l) = 1 and /1 p f — 1, then 

C(l c , o(p), p fq ) = C(l c , o(p), pf) 
since if /* || pf — 1 , k > 0 , then l k || pf 9 — 1 . 


( 2 ) 
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If l \ (p f — 1), r is the order of p r modulo / and (q, Ir) = 1, then 

C(f, o(p), /") = C(l c , o(n), /). (3) 

On using Theorem 4 from Mann and Velez [2], we can also show, by 
induction, that F depends only on m, M(//) = //, and the order of p 

modulo ZP. Hence, we set = C(m, o(fi), P J )- 

If ( q , m) = 1, then we have 

C(m, q ■ o(p), p f ) = C(m, o(p), //). (4) 

Let m = q\' • • - q e s i\ l ••• If, where the «?,, f are distinct primes, q t \p f - 1, 
/, \pf - 1 and r, is the order of p r modulo /,. If (q, <?;) = (<?, rjlj) = 1, for all 
i, j, then 

C(m, o(p), p r ) = C(m, o(p), p f9 ). (5) 

Given x m - p and N(^) = p f , write 

m — q\' - qflf ■ ■ ■ If, where q ,, l } are dis¬ 

tinct primes, q ( \ p f — 1, f \ p s — 1, rj is the 
order of p J modulo L, and <?*' || p f — 1. 

( 6 ) 

Lemma 2 Let x m — p be irreducible over F, ( mp , #) = 1, and 
= Z, diifil then f \ mR n , where R„ = fl" = i 0. f or each L 

Proof We shall prove this by induction on n. Assume that n = 0 in (6) 
above. Then by Theorem 5 in Mann and Velez [2], we have that F x ¥ m (0 > ) = 
UU 1 F 'JV (^)- But, from Rules 1-5 in Mann and V61ez [2]> the theorem 
holds for each hence it holds for m. 

Assume that the assertion is true for n = k and m = q\ l ■■■ qflf ■ ■ ■ 

Iff l = Iff}. Let = Z £.{/<]> then by induction,/, | (m, • R k ), for 

each f that appears in the sum F '¥ mi (&). 

Let be an ideal factor of & in F, = F(p llm ') of relative degree f„ for 
each i. Then by Theorem 4 in [2], we have that 

F'P m (^)=(Z^[/ i ])f.n +1 - 

But if fl 'IV.-W = Xj dijUijl then/ 0 ],/« + + ; • r* + , by Rules 1-5 in Mann 
and Velez [2], for each j. Hence /,/yJni, R k fff}r k+i . So f,f ij \mR k+1 . 
However, ffj is an arbitrary term in F '¥„(&), so the induction is 
complete. | 

Lemma 3 Let x m — p be irreducible over F, (mp, &)= 1, m = m,/ c , 
(m„ /) = 1, l a prime, and F 'V mi (0 > ) = Z 9i[ft]- V^\P f ~ 1 an & W’fd = 1 >/ or 
all i, thenp'Pj^) = f T mi (^) • f 'V lc (&). If l fp* -land (rl,f) = 1 ,for alii, 
then F '¥ m (0>) = F '¥ mi (&) F '¥,<(&)■ 

Proof Let F, = F(p llm> ) and a prime factor of # in F l of relative 
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degree/ f , for each i. Then by Theorem 4 in Mann and Velez [2], we have that 

But, = C(F, o(ti l/mi ), p"% By 

Lemma 1, if /" || o(p), then f||o(// 1/mi ) since (m x , l) = 1. Hence 

c(r,o(n'i”"),p"‘) = c(r,o(n),p"‘). 

If l |f^ — 1 and l w || p f — 1, then l w || p ffi — I, since (f , l) = 1. 

If / \ p f — 1 and l w || p fr — 1, then r is the order of j/A and r\\p fftr — 1, 
since (f , rl) = 1. 

Hence C(/ c , o(p), p ffi ) = C(/ c , o(p), p f ) = JJ#). So we have that for 
each i, Ft '?„(#,) = F Hence F T m (^) = F T mi (#) • f | 

Theorem 1 Let x m — p be irreducible over F, m = m x l c , l a prime, 
(mp, 0>) = (m„ l) = 1, and l is the largest divisor of m such that l\ p f — 1, 
then 

r't'J#’) = + C(m lt o(g), P fr )( F '¥ l <(&) - 1[1]). 

Proof Let m = q\ l qfl\' If = m x l c , where l c - If. Since r x |/ f — 1, 
we have that r, < /, < l, i < n, hence (l, R„. 1 )= 1. Hence (/, m, R n ~ t ) = 1, 
where i? n _, =[]?=/r, - 

Let F, = F(p ul ), then F ^,(^>) = 1[1] + (/ - l)/r[r]. If c = 1, then 

P^mP) = m (^) + (, _ 1)/r[r ] . C(wii o{ ^ pfr) 

= F^n,^) + c(m u o(p), p")( F - 1 [ 1 ]). 

Hence, the theorem is true for c = 1. Assume it is true for c = k and let 
m = m x l k+1 . Then 

F^m(^) = F^mA^) + U~ OAM ' C(m, l k , l • o(p), p fr ). 

By Lemma 2, if C(m u l ■ o(p), p /r ) = £ gif], then f\m l R n . l . Hence 
(f , /) = 1, so by Lemma 3 we have that 

C(m x l k , l ■ o(p), p fr ) = C(m x , o(p), p fr )C(l k , l ■ o(p), p fr ). 

Hence 

f^JP) = f^M + C(m„ o(p), p fr )( F 'V, k (^) - 1[1]) 

+ (/ - l)/r[r]C(w„ o(p), p fr )C(l k , l • o(/i), p fr ) 

= F'VmX®) + C(m x , o{p), p fr ) 

■ (r'W) + (/ - l)/r[r]C(/*, l • o(/i), /') - 1[1]) 

= + C(m u o(p), p fr ){ F x V lk+l (0i>) - 1[1]), 

by Lemma 3 of [2], So the induction is complete. | 

Theorem 1 gives a recursive procedure for writing F in terms of 

the counting functions of the divisors of m. That is, if m = q\' qfl V • • • 
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l c „" = mj l c n ", as in (6), where < l 2 < - ■ < l„, then by Theorem 1 

+ C(m u 0(H), p f '*){ F 'Vt'Ji?) ~ 1[1]). 

Note, however, that has exactly n — 1 primes dividing m and not 

dividing p f — 1, C(m u o(p ), p frn ) has no more than n — 1 primes dividing m 
and not dividing p frn — 1, while F *P m (^) has exactly n primes dividing m and 
not dividing // — 1. Hence, this procedure can now be applied to p'P 
and C(m u o(p), p fr "). 

Example 1 If n = 1, then 

F V) = F y mi (^) + C(m u o(fi), / r ')(f - 1[1]). 

However, by Theorem 5 in Mann and Velez [2], we have that 

f^J?) = fl + {f\C(qT, o(p), p"')) (f'V.A^) ~ Ul])- 

i = 1 \i= 1 / 

So, F ¥ m (^) is expressed in terms of the known counting functions of its 
prime divisors. 

Example 2 m = q\ l • • • q e s s l\H c f — m x l x < l 2 , then 

f V) = {») + C(m 1 /?, o(/z), - 1[1]) 

= F ¥„,,(#) + C(m u o(n), ~ 1[1]) 

+ cfot?, o{p), U 1 ])- 

Now, the first two terms can be factored as in Example 1. We consider two 
cases: (1) l x \p fri — 1, (2) l x \ p /r2 — 1 and the order of p /rz modulo l x is r\, 
then in case 1, 

C(m j l\', o(p), A 2 ) = f\C(qV, o(p), A 2 ) • C(/V, o(p), A 2 ), 

i = 1 

by Theorem 5 in Mann and Velez [2]. 

In case 2, we must apply Theorem 1 and we have 

C(mj fl, o(p), p fr2 ) = C(m l9 o(p), p fr2 ) 

+ C(m lt o(p), o(p), A 2 ) - 1[1]). | 

Lemma 4 Let x lc — p be irreducible over F, (Ip, &) =1, /* || pf — 1, 
k> 1, and l n = (o(p), /*), then F ^ |c (^) = / c [l] iff c < k - n. 

Proof The lemma is proved by referring to Rules 1-4 in Mann and 
Velez [2]. If Rule 1 applies, then 

f 4V(^) = /min{c,k-„}|- /m ax{0.c-lc + «}j = /c^] j ff c < __ „ 

Rule 2 cannot apply here since k = 1 = n. If Rules 3 or 4 apply, then n = 0 
and the assertion is obvious. | 
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Lemma 5 If X is an integer in F, (A, PP) = 1, then for each m there is an 
irreducible polynomial x m — p such that p = X (mod (?). 

Proof Let PP' be a prime ideal in F such that (&', PP) = 1. Then, we can 
find an ideal si such that (si, 3PP?') — 1 and PP'si = ( 7 c) (see Mann [1, p. 
50]). Hence J P' || n. Since (,P'. -P) = 1, we can solve the system of congruences 
x = X/n (mod PP), x s 1 (mod PP'). Let /? be such a solution and set p = fin. 
Then p = X (mod PP). Furthermore, since PP’ || p, we have that x m — p is an 
Eisenstein polynomial, and hence, x m — p is irreducible for all m. | 

In Lemma 3 we had the situation where F T mi(c (#) = F 'V mi (pP) ■ F 'V lc (3P). 
The next theorem, in fact, characterizes this phenomenon. 

Theorem 2 Let m = m, l c , (m,, l) = 1, x m — p irreducible over F, then 
r'V^PP) = r'VmfPP) ' for all such m, iff l\pf - 1 and c <k - n, 

where k and n are as in Lemma 4. 

Proof Assume first that /1 p f — 1 and c < k — n. Let F x = F(p lllc ), then 
by Lemma 4, we have that F 'F, c (.pt>) = / c [l]- Hence, if PP { is any factor of in 
F u then 

F ^ m (PP) = l^fPP,) = l c C(m u o(pV% pf) = FC(m u o(p), pf), 

since (l, m x ) = 1 . Hence F T' m (^’) = l c F '¥ mi (& > ) = F^ui^F^mA^)- 

Next, assume that F 'V m (^ > ) = ■ p'VifP), for all m„ (m„ l) = 1. 

Assume first that l \ pf - 1 and let r be the order of pf modulo /. Let f be a 
prime such that f \ pf - 1 and (r 1; r) f 1 , where r x is the order of pf 
modulo l[. By assumption, 

f %,,(#)= F^m • fV,^) =jW)(i[i] + (h - 

Now, let Fi = F(p llh ) and PP 0 a prime factor of PP in F x of degree 1, PP X a 
prime factor of P in If of degree r x , then 

f'VuX^) = 0 ) + (h - ‘ C(l, o(p), pf'') 

= „¥,(<?) + (h - l)/r 1 [r 1 ]C(/, o(p), pf'') 

=f^,^) ■ + (h • F nn 

Hence, f 'F,(^ > ) = C(l, o(p), p fri ), but this is impossible since (r, r,) f 1. So, 

if F 'Vjf?) = F T ’ mt (&) ■ F^i‘(^i for a11 m i P rime to l > then l \P f ~ l - since 
1 1 pf — 1, let l k || pf — 1 and choose k t so that k x >k and c < k x - n. Set 
K — k^—k. Let f be a prime so that (l u p f — 1) = 1 and l K \ r u where rj is 
the order of pf modulo f. By Lemma 5, we know that there exists an 
irreducible polynomial x w ‘ — X, where X = p (mod J P). So o(X) = o(p). 
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fV W i (^) = + (/i - l)/r 1 [r 1 ]C(r, ^ /'*)• 

Since l K | and l k || // r — 1, we have that l kl | p fri — 1. But c < — n, hence 

by Lemma 4, we have that C(I C , o(/i), p^ 1 ) = Z c [l]. Hence 

f^ici^) = fW + (li - lVr^r] • / c [l] 

= Fmw = fW + (/l - W), 

so F ^c(^) = Z c [l], hence c<k — n , by Lemma 4, and the theorem is 
proven. | 

Theorem 3 Let x m — \i be irreducible over F and m is written as in (6), 
then 

f^)= ri n 

i=i j=i 

i/ (a) (r t , r,) = (r t , lj) = 1, and (b)/or eac/i i, dt/zer (qj , r f ) = 1 or ej < kj — n } , 
for each j, w/zere g"' || o(p) modulo 0*. 

Proof Assume that (r f , rj) = (r t , Iff = 1 and , r t ) — 1 or e } < kj — nj. 
We shall induct on the number of prime factors of m. If m is a power of a 
prime, then we are done. Hence assume that m has k + 1 distinct prime 
factors and the theorem is true for k prime factors. If ej < kj — n j9 then by 
Lemma 4 we have that F ¥ m (^) = F ^ mi (#) * j ?P qj ej{f?\ where m x = mjq) j . 
Then induction applies to F 4 / mi (^) and the theorem is proven. Hence, we 
may assume that ej > kj — nj , for all j. Then we must have that for each i and 
j, ( r M 4j) = L If n = 0, then we are done by Theorem 5 in Mann and Velez 
[2]. Hence, assume that n f 0. Without loss of generality, we may assume 
that l n > l i9 i < n. Then, by Theorem 1, we have that 

F Y m (^) = + C(m u o(n), p"')( F 4V„(^) - ![!])• 

where = m/l c n n . But ( r n , r { l t ) = 1 = (r n , qj), for all j, i < n . Hence, by (5), 
we have that 

C(m„ o(p), pf'*) = C(m u o{p), /) = 

so F 'Pjf?) — f 'F w1 (^ > ) • f¥\ n c*{p\ so, we can apply the induction hypoth¬ 
esis to and the induction is complete. | 

Conjecture We conjecture that if 

f^)= n fi 

i= 1 J =1 

the numerical conditions, (a) and (b), in Theorem 3 hold, that is, that 
Theorem 3 is actually a characterization of this phenomena. 
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Orthogonal designs were introduced as a generalization of Hadamard arrays and their existence 
has been studied by several authors. This work has led to some intricate and elaborate methods and 
results , but , to date , no general method has been developed. In this paper we claim to isolate the 
algebraic properties of orthogonal designs and show that the existence of these designs is dependent 
upon the equivalence of quadratic forms. We give strong necessary conditions for the existence of 
orthogonal designs in orders 2 r • n 0 , n 0 odd, r = 0, 1, 2, or 3. 


Definition An orthogonal design in order n and of type (u u ...,u s )on the 
commuting variables x l9 x s is an n x n matrix X with entries from {0, 
±x u ±x s } such that XX X = u f xf)/ n . 

Alternatively, each row of X contains u t entries of the type ±x t and the 
rows are formally orthogonal. Such matrices were first described in Gera- 
mita et al. [2] and have been very useful for determining the existence of 

t This paper is based on the author’s doctoral dissertation written at Queen’s University, 
Kingston, Ontario, Canada, under the supervision of Dr. A. V. Geramita. 

X Present address: Department of Mathematics, Royal Roads Military College, Victoria, 
British Columbia, Canada. 
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weighing matrices and, in particular, Hadamard matrices. (See Geramita 
et al [2], Geramita and Wallis [3] or Germaita and Wallis [3a].) 

If X is as above, we may write X = x t where the A t are the 
coefficient matrices. Then the A { are 0, +1 matrices of order n such that: 

(i) the Ai are disjoint, i.e., if A { = (a kl ), A } = (b pq ) (i =f= j), then 
a kl b kl — 0 for every /c, /; 

(ii) A { A\ = Uil n , 1 < i < s; 

(iii) A ( A) + AjA\ = 0, i±j. 

It has been demonstrated that each of the above properties sets a strong 
briteria for the existence of an orthogonal design. In this paper, however, we 
shall examine only the algebraic properties, namely (ii) and (iii). For this 
purpose we make the following definition: 

Definition A rational family in order n and of type [u l9 ..., wj is a family 
of n x n rational matrices {A x , ..., A s } such that (ii) and (iii) above are 
satisfied. 

Surely a rational family reflects the algebraic properties of an orthogonal 
design, and the existence of an orthogonal design will imply the existence of 
a rational family in the same order and of the same type. 

If {A u ..., A s } is a rational family as above, then: 

(i) {a t A ( , 1 < i < s}, where a, e Q is a rational family in order n and of 
type [alu l9 ..., a s 2 uj; 

(ii) {A x + A 2 , A 3 , , A s } is a rational family in order n and of type 

[u x + u 2 , u 3 , ..., uj; 

(iii) if P and Q are n x n rational matrices such that PP* = pl n and 

QQ l = ql n , then {PA { Q, 1 < i < s} is a rational family in order n and of type 
L mq , pw s q]. 

Thus, one may “translate” a rational family in the same order and so 
change the type numbers to a certain extent. 

The first theorem for rational families limits the number of matrices in 
such a family and was first noted in Geramita and Pullman [1] and later 
described in terms of orthogonal designs. 

Theorem 1 (Radon-Geramita-Pullman) There exists a rational 
family in order n with s members if and only if s < p(n) (the Radon function), 
where , if n = 2 4a+5 • n 0 , n 0 odd , 0 < b < 4, then p(n) = 8a + 2 b . 

Necessity in the above theorem is proved using Radon’s result on the 
maximal number of anticommuting, skew-symmetric, orthogonal real 
matrices in order n (see Radon [6]). Sufficiency is given by Geramita and 
Pullman [1], who have shown that, for a given n, there exists a rational 
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family in order n with p(n) members and of type [1, 1, 1]. In fact, the 

matrices in their rational families are disjoint 0, +1 matrices. 

The Radon function is easily computable; for example, p(2n 0 ) = 2, 
p(4n 0 ) = 4, p(8n 0 ) = 8, p(16n 0 ) = 9, ..., where n 0 is odd. 

Further existence theorems for orthogonal designs have been found in¬ 
volving type numbers. In certain orders, Geramita and Wallis have shown 
that these type numbers were closely related to sums of squares (see 
Geramita et al [2] and Geramita and Wallis [3].) We shall show that, more 
precisely, rational families (and hence orthogonal designs) rely on the equi¬ 
valence of rational quadratic forms. We refer the reader to Serre [7] for 
particulars and shall establish notation and definitions. 

Let / be an n-dimensional quadratic form over Q, and let M f be the 
uniquely determined symmetric n x n rational matrix such that 
f(X) = X'M f X . 

Two quadratic forms / and g are said to be equivalent if there exists an 
invertible matrix P such that M g = PM f P\ We write/~ p to denote this 
equivalence. Every quadratic form / is equivalent to a diagonal form, i.e., 
there exists an invertible matrix P such that PM f P l is the diagonal matrix 
dg(a x ,..., a n ) for some a t e Q. We denote this diagonal form as <a l9 ..., a„>. 
Let n • <1> denote the form given by the identity matrix 

If-/ = <u 1? ..., a„>, we let d(f) be the discriminant of/ i.e., the image of 

n? a i in the 8 r ° u p q*/q* 2 

Let p be a prime and let Q p denote the field of p-adic numbers. If a, b are 
in Q p , the p-adic Hilbert symbol of a and b is (note a, b nonzero) 

| 1 if 3x, y 9 z in Q p (not all zero) such that ax 2 + by 2 — z 

( a > = j , ' 

j — 1 otherwise. 

If/ = <fli, ..., a n ) is a rational quadratic form, then the p-adic Hasse invar- 
iant of/is s p (f) = []«; (<*.> a } ) P - 

We can now state the main theorem on the classification of rational 
quadratic forms. 

Theorem 2 (weak Hasse-Minkowski) / and g are equivalent rational 
quadratic forms if and only if they have the same dimension , the same discrimin¬ 
ant , the same Hasse invariant at each prime , and the same number of positive 
terms in equivalent diagonal forms . 

Theorem 3 Ifn = 1, 2, 4, or 8, then there exists a rational family in order 
n and of type [u l9 ...,u n ]if and only ifn • <1> 2 * <u l9 ..., u w >. 

Proof If {A i9 1 < i < n} is a rational family in order n and of type [u i9 
..., u„], let Vi be the vector in Q n that corresponds to the first row of A x . Then, 
by the properties of a rational family, the v t form an orthogonal basis for Q n 
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where the norm of ^ (in the usual Euclidean sense) is u t . Let P be the n x n 
matrix whose ith row is v { . Then PP l = dg(w 1? ..., u n ) and, consequently, 
n * <1> — <«!, u n }. 

Conversely, if n * < 1 > ~ <u 1? ..., u n >, then there exists an invertible n x n 
matrix P = (p iy ) such that Pl n P l = dg (u l9 ..., u n ). Let {£*, 1 < i < n} be the 
Geramita-Pullman rational family in order n and of type [1, ..., 1]. (Note 
that in Theorem 1, p(n) = n if and only if n = 1, 2, 4, or 8.) (See Taussky [9] 
for a discussion of the 1, 2, 4, or 8 problem.) 

Let A t = Yj= i Pij Bj • Then it is a straightforward verification to show 
that {A t , 1 < i < n} is a rational family in order n and of type [tq, ..., u n ]. 

The force of the above theorem is realized when one uses the following 
reduction theorem proved by Shapiro using his classification of similarities 
of quadratic forms. 

Theorem 4 (Shapiro, [8]) If n = 2 m • n 0 , rt 0 odd , r/ien there exists a 
rational family in order n if and only if there exists a rational family in order 2 m 
of the same type. 

In [8], Shapiro proves the above theorem for m = 1, 2, or 3. In a recent 
private communication, he has proved the general result. 

We may now apply the above theorems to some particular cases of 
rational families. 

Theorem 5 (i) If n is odd , then there exists a rational family in order n 
and of types [a] if and only if a is a square. 

(ii) Ifn = 2 (mod 4), then there exists a rational family in order n and of 
type [ a , b ] if and only if ab is a square and a and b are both sums of two rational 
squares. 

We note that the above results were already known in the context of 
weighing matrices and orthogonal designs (see Geramita et al [2]). 

Proof (i) follows directly by applying Theorems 3 and 4 and equating 
the discriminants of the forms <a> and <1>. 

(ii) Again by Theorems 3 and 4, we need only establish when <n, by ^ 
<1, 1>. Using Hasse-Minkowski and some standard Hilbert symbol argu¬ 
ments, one can show that this equivalence exists if and only if the stated 
conditions on a and b hold. 

Theorem 6 If n = 4 (mod 8), then there exists a rational family in order 
n and of type : 

(i) [a, fc, c, d\ if and only if abed is a square and , at every prime p, s p ((a, 

b , c, dy) = 1 ; 

(ii) [a, b, c] if and only if at every prime p, s p «a, b, c» (- 1, abc) p = 1; 
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(iii) [a 9 b] if and only if ab is a sum of three squares. 

(iv) [a] always. 

Proof (i) follows directly from Theorems 2-4. 

(ii) By Theorem 4, we may reduce to order 4. We have seen that the 
existence of the rational family is equivalent to the existence of a 4 x 4 
matrix P such that PP l = dg(a, b , c , x) for some x e Q. In terms of quadratic 
forms, 4<1> ^ <a, b , c, x>. By equating discriminants, one sees that x must 
be abc , and the condition of (ii) will follow by equating the Hasse invariants. 

(iii) was originally shown by Geramita and Wallis using a Pfaffian argu¬ 
ment [3]. However, as in (ii), one may see that the existence of the rational 
family is equivalent to the existence of a rational number x such that 
4 * <1> ~ < 0 , b, x, abx >. Using standard arguments, one can show that such 
an x exists if and only if ab is a sum of three squares. 

(iv) follows since, for any positive a, 4 • <1> ~ <a, a, a 9 a ). 

Theorem 7 If n = 8 (mod 16), then there exists a rational family in order 
n and: 

(i) with five or fewer members always; 

(ii) of type [a u ... 9 a s \if and only ifY\l a i a square and , at every prime 
p,s p ((a 1 ,...,a 8 »=l; 

(iii) of type [a l9 ..., a 7 ] if and only if at every prime p, 

s p «a„ ...,a 7 »(-l, Hi a i)p ~ 

(iv) of type [a l9 ..., a 6 ] where a 1 a 2 a 3 a 4 . is a square and s p ((a l9 ..., 
a 4 » = 1 if and only if a 5 a 6 is a sum of three squares. 

Proof Clearly (ii) and (iii) are the analogous of (i) and (ii) in Theorem 6 
and are proven similarly. 

In (iv), the additional hypothesis on a l9 ..., a 4 merely states that a 
rational family of type [a l9 ... 9 a A ] exists in order 4. If a 5 a 6 is a sum of three 
squares, then a rational family of type [a 5 , a 6 ] exists in order 4. By tensoring 
the members of one family with I 2 and of the other with [_? o]> one then 
constructs the required rational family in order 8. The converse argument is 
similar to that used for (iii) in Theorem 6. 

(i) requires a slightly different technique and we thank Shapiro for shar¬ 
ing a critical observation. If a, b , c, d, and e are any positive rational num¬ 
bers, then let 8 = discriminant<a, b, c, d, e). Since every positive rational 
number is represented by/= < a , b, c, d 9 e) [7, p. 37], 8 is represented by/ 
and, hence /~ (8 9 x, y, z, xyz ) for some x, y 9 z in Q [7, p. 33]. A routine 
check will show that 8 • <1> ~ <<5, x, y 9 z, xyz, 8xy, 8xz 9 8yz > and, hence, for 
some quadratic form g 9 8 • <1> ~f _L g. The conclusion of (i) now follows. 

The only question left unanswered in orders 8 • n 0 , where n 0 is odd, is 
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what happens in (iv) of Theorem 8 if one removes the additional hypothesis. 
That is, if a l9 a 6 are positive rational numbers of which no four satisfy 
completely the added hypothesis, does there exist a rational family in order 8 
and of type [a u ..., a 6 ]? 

Also of interest are the orders divisible by 2 r , where r > 3. In personal 
communication, Shapiro indicates that he has proved the following result: 

Theorem 8 (Shapiro) If n = 16 (mod 32), then there exists a rational 
family in order n and of type (a l9 ..., a 9 )if and only if at every prime p , s p (a 1 , 

.. a 9 ) = 1. 

This result is the first of its kind in orders 16 • n 0 , if n 0 is odd, and will 
eliminate many 9-tuples as possible types of rational families in these orders. 
Shapiro has shown that a relationship between rational families and quadra¬ 
tic forms does indeed exist in every order. His result now gives a complete 
solution of the problem of existence of rational families in every order. (We 
indicate, in an appendix, the orthogonal designs in order 16 on nine var¬ 
iables whose existence is still in doubt.) 

We have noted that an orthogonal design gives rise to a rational family 
in the same order and of the same type. Hence all necessary conditions for 
the existence of rational families are necessary conditions for the existence of 
orthogonal designs. In the appendices to this paper we give some indication 
of how these results help determine the existence of orthogonal designs in 
some particular orders. 

The sufficiency of the conditions above for rational families would indi¬ 
cate that our approach has exhausted the algebraic properties of orthogonal 
designs. However, there are also combinatorial results that prohibit the 
existence of orthogonal designs of certain types and orders, and we refer the 
reader to a sequence of papers which have recently appeared by Geramita, 
Wallis, et al For the purpose of this paper, we shall note one such result and 
show how it may be blended with the above theorems to achieve a more 
complete picture. 

Theorem 9 (Geramita-Vemer; see [4]) If n = 0 (mod 4), then there 
exists an orthogonal design in order n and of type (u u ..., u s ) where 

u.. = n — 1 if and only if there exists an orthogonal design in order n and of 
type (1, «!, u s ). 

We couple this theorem with Theorem 6 and obtain the following result: 

Theorem 10 If n = 4 (mod 8) and if there exists an orthogonal design in 
order n and of type: 

(i) (a, b , c) where a + b + c = n — 1, then abc is a square and , at every 
prime p, s p «a, fe, c» = 1; 

(ii) (a, b) where a + b = n- 1, then at every prime p, (a, b) p = 1. 
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Similarly, combining Theorems 7 and 9, we get 

Theorem 11 Ifn = 8 (mod 16) and if there exists an orthogonal design in 
order n and of type: 

(i) (a u ..., a-,) where a { = n - 1, then a t is a square and, at 
every prime p, s p ««i, ..., a 7 » = 1 ; 

(ii) (flj, ..., a 6 } where a, = n — 1, then, at every prime p, 

| - i, FI «ij s P (< a i> ..., a 6 » = 1; 

(iii) (a l9 ..., a 5 ) where fj? a t is a square , a t = n — 1, and , at every 

prime p , s p «a 1 , a 4 » = 1, then a 5 is a sum of three squares; 

(iv) (a l5 .. a 5 ) where a, is a square , a t = n — 1, ^ 

prime p, s p «a 1? a 2 , a 3 )) = 1, then a 4 a 5 a sum qf squares. 

Examples of tuples which are eliminated by Theorems 10 and 11 as types 
of orthogonal designs are easy to find. For example, in order 20, the tuple 
(1, 2, 16) certainly satisfies condition (ii) in Theorem 6, but does not meet 
the condition (i) of Theorem 10. Similarly, the tuple (1, 1,1,1,1,1,17) is not 
the type of an orthogonal design in order 24 since it fails to satisfy condition 
(i) of Theorem 11. 


Appendix I 


In this appendix we wish to update the status of known orthogonal 
designs in order 20. We use as a base the list given by Geramita and Vemer 

[4]- 

The following 4-tuples are easily eliminated using Theorem 6 (i): (1,1,2, 
16), (1, 1, 5, 8), (1, 1, 5, 13), (1, 1, 8, 10), (1, 2, 6,11), (1,2,2,8), and (2,3,7,8). 

We apply (ii) of the same theorem to the 3-tuples in doubt and find that 
the following tuples fail at the indicated prime: 


(1.3,9) 

(h 3, 10) 

(1, 3, 16) 

(1.4.6) 

(1, 6, 13) V 

(2.5.6) 

( 2 , 6 , 11 ) 

(3, 4, 10) 

(3, 7, 10) 

(1, 3, 11)1 
(3,4, 11)| 


at p = 3 


at p = 11 


( 1 , 2 , 10 ) | 
(1, 5, 13) 
( 1 , 8 , 10 ) 

(5.6.7) | 

(1.5.8) | 

(2.3.8) 

(2.7.8) \ 


(2, 3, 13) 


at p = 5 


at p = 2 


at p = 13 
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Similarly, the tuples (1, 2, 16), (3, 16), and (6, 13) are eliminated by 
Theorem 10. 

J. Wallis reports in private communication that designs of type (3, 3, 6, 
6 ), (1, 3,14), (3, 6, 8), (2, 5, 7), and (7, 10) have been found in order 20. Using 
Theorem 13 of Geramita et al. [2], we have found designs of the following 
types where we indicate the first rows of the circulant matrices used: 

type (1, 4, 5, 5): 

adOO—d —dcOOc cdb—bd 0c bb — c 
type (1, 2, 2, 9): 

abOO-b bbb 00 cdb-b 0 d-cOb-b 
type (1, 2, 8, 9): 

a b c —c —b b b b c —c c d b —b c d —c —c b —b 
type (1, 4, 13): 

ac—cc—c 0 c —c —c —c 0 cbb—c b c c c —b 

At present, it is still unknown whether the following tuples are types of 
designs in order 20: 

(1, 3, 6, 8) (5, 5, 9) 

(1, 4, 4, 9) (3, 7, 8) 

(1, 5, 5, 9) 

(2, 2, 5, 5) 

Appendix II 

We should like to disclose the early status of orthogonal designs on four 
variables in order 28. Using Theorems 6 and 9 we can eliminate all 4-tuples 
except the 73 listed below (this list has been verified by an APL search on a 
Burroughs 6700 computer as programmed by J. Verner): 


(h 

1, 1, l)v 

(1, 3, 6, 18) 

(2, 

4, 8, 9) 

(1, 

1, 1, 4)v 

(1, 4, 4, 4)v 

(2, 

5, 5, 8)v 

(1, 

1, 1, 9)v 

(1, 4, 4, 9)v 

(2, 

8, 8, 8) 

(1, 

1 , 1 , 16)v 

(1, 4, 4, 16) 

(2, 

8, 9, 9) 

(l 

1, 1, 25)v 

(1, 4, 5, 5)v 

(3, 

3, 3, 3)v 

(1, 

1 , 2, 2)v 

(1, 4, 8, 8) 

(3, 

3, 3, 12) 

(1, 

1 , 2, 8)v 

(1, 4, 9, 9) 

(3, 

3, 6, 6) 

(1, 

1, 2, 18) 

(1, 4, 10, 10) 

(3, 

4, 6, 8) 

(1. 

1, 4, 4y 

(1, 5, 5, 9)v 

(3, 

6, 8, 9) 

(1, 

1, 4, 9)v 

(1, 8, 8, 9) 

(4, 

4, 4, 4), 
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(1, 1, 4, 16) 
(1, 1, 5, 5)/ 
(1, 1, 8, 8) 

(1, 1, 9, 9)/ 
(1, 1, 10, 10) 
(1, 1, 13, 13) 
(1, 2, 2, 4)/ 
(1, 2, 2, 9)y 
(1, 2, 2, 16) 
(1, 2, 3, 6)/ 
(1, 2, 4, 8)/ 
(1, 2, 4, 18) 
( 1 , 2 , 6 , 12 ) 
(1, 2, 8, 9)y 
(1, 3, 6, 8) 


( 2 , 2 , 2 , 2)y 
( 2 , 2 , 2 , 8 )/ 
(2, 2, 2, 18) 
(2, 2, 4, 4)/ 
(2, 2, 4, 9) 

(2, 2, 4, 16) 
(2, 2, 5, 5)/ 
(2, 2, 8, 8)/ 
(2, 2, 9, 9) 

( 2 , 2 , 10 , 10 ) 

(2, 3, 4, 6) 

(2, 3, 6, 9) 

(2, 4, 4, 8)/ 
(2, 4, 4, 18) 
(2, 4, 6, 12) 


(4, 4, 4, 9) 

(4, 4, 4, 16) 
(4, 4, 5, 5)/ 
(4, 4, 8, 8) 

(4, 4, 9, 9)y 
(4, 4, 10, 10) 
(4, 5, 5, 9) 

(5, 5, 5, 5)y 
(5, 5, 8, 8) 

(5, 5, 9, 9) 

(6, 6, 6, 6) 

(7, 7, 7, 7)/ 


The y indicates those tuples that are the types of orthogonal designs 
constructed by: 

(i) direct sum of designs in orders 16 and 12; 

(ii) designs in order 14 and the amicable pair of types (1, 1); (1, 1) in 
order 2; see construction 22 of [2]; 

(iii) Baumert-Hall array of type (7, 7,7,7); see Wallis et al. [10, p. 261]; 

(iv) construction 16 of [2]; 

(v) Golay sequences; see [3a]. 


Appendix III 


Shapiro’s result in Theorem 8 is the first algebraic restriction on the type 
numbers of orthogonal designs in order 16. Below we list all 25 possible 
types of designs on 9 variables in order 16 not eliminated by Theorems 8 and 
9. 


( 1 , 1 , 1 , 1 , 1 , 1 , 1 , 1 , 1 )/ 

( 1 , 1 , 1 , 1 , 1 , 1 , 1 , 1 , 2 ) 

( 1 , 1 , 1 , 1 , 1 , 1 , 1 , 1 , 3 ) 

(1, 1, 1, 1, 1, 1, 1, 1, 4) 

( 1 , 1 , 1 , 1 , 1 , 1 , 1 , 1 , 5 ) 

(1, 1, 1, 1, 1, 1, 1, 1, 6) 

(1, 1, 1, 1, 1, 1, 1, 1, 8) 

( 1 , 1 , 1 , 1 , 1 , 1 , 1 , 2 , 2 ) 

(1, 1, 1, 1, 1, 1, 1, 2, 4) 


(1, 1, 1, 1, 1, 1, 1, 4, 5) 

( 1 , 1 , 1 , 1 , 1 , 1 , 2 , 2 , 2 ) 

(1, 1, 1, 1, 1, 1, 2, 2, 3) 

(1, 1, 1, 1, 1, 1, 2, 2, 4) 

( 1 , 1 , 1 , 1 , 1 , 1 , 2 , 2 , 6 ) 

(1, 1, 1, 1, 1, 1, 2, 3, 5) 

(1, 1, 1, 1, 1, 1, 2, 4, 4) 

(1, 1, 1, 1, 1, 2, 2, 2, 2) 

(1, 1, 1, 1, 1, 2, 2, 3, 4) 
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(1, 1, 1, 1, 1, 1, 1, 3, 4) 

(1, 1, 1, 1, 1, 1, 1, 3, 6) 
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(1, 1, 1, 1, 1, 2, 3, 3, 3) 

( 1 , 1 , 1 , 1 , 2 , 2 , 2 , 2 , 2 ) 

(1, 1, 1, 1, 2, 2, 2, 2, 4) 

(1, 1, 2, 2, 2, 2, 2, 2, 2)v 


/ indicates that a design of that type exists in order 16 (see [2]). 

Combining Theorems 8 and 9, one may also eliminate the following 
8 -tuples as types of orthogonal designs in order 16. 


(1, 1, 1, 1, 1, 3, 3, 4) 
(1, 1, 1, 1, 2, 2, 2, 5) 


(1, 1, 1, 2, 2, 2, 3, 3) 
(1. 1. 2, 2, 2, 2, 2, 3) 


REFERENCES 

[1] A. V. Geramita and N. J. Pullman, A theorem of Hurwitz and Radon and orthogonal 
projective modules, Proc. Amer. Math. Soc. 42 (1974), 51-56. 

[2] A. V. Geramita, J. M. Geramita, and J. S. Wallis, Orthogonal designs, to appear, J. Linear 
and Multilinear Alg. 

[3] A. V. Geramita and J. S. Wallis, Orthogonal designs IV. Existence questions, J. Combina¬ 
torial Theory Ser. A 19 (1975), 66-83. 

[3a] A. V. Geramita and J. S. Wallis, Orthogonal designs II, to appear, Aequationes Math. 

[4] A. V. Geramita and J. H. Vemer, Orthogonal designs with zero diagonal, to appear, 
Canad. J. Math. 28 (1976), 215-224. 

[5] M. Plotkin, Decomposition of Hadamard matrices, J. Combinatorial Theory Ser. A 12 
(1972), 127-130. 

[6] J. Radon, Linear Scharen Orthogonaler Matrizen, Abh. Math. Sem. Univ. Hamburg. 1 
(1922), 1-14. 

[7] J. P. Serre, “A Course in Arithmetic,” Springer-Verlag, New York, 1973. 

[8] D. B. Shapiro, Similarities, quadratic forms, and Clifford algebras, Ph.D. Thesis, Univ. 
California, Berkeley, 1974. 

[9] O. Taussky, (1, 2, 4, 8)-sums of squares and Hadamard matrices, “Combinatorics,” Proc. 
Symp. in Pure Math., Amer. Math. Society, Providence, Rhode Island, 1971, pp. 229-233. 

[10] W. D. Wallis, A. P. Street, and J. S. Wallis, “Combinatorics: Room Squares, Sum-Free 
Sets, Hadamard Matrices,” Lecture Notes, Vol. 292, Springer-Verlag, Berlin-Heidelberg- 
New York, 1972. 


AMS (MOS) 1970 subject classifications: 05B30, 15A66. 



On Finite Projective Planes 
of Lenz-Barlotti Class 13 
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COLUMBUS, OHIO 


We study the existence of finite planes of Lenz-Barlotti class 13, in particular for orders n < 250. 
By extending a result of W. M. Kantor , we are able to exclude certain values of n that were not 
hitherto rejected. We show that n 9 by direct calculation , and list all n < 250 for which the 
existence question is undecided . We also consider finite planes of class 13 whose additive loop has 
both inverse properties; in this case further orders can be excluded , in particular n = 25 and those n 
with n — 1 = 3 r , 2(3 r ), 3 p or 6p , where p is a prime > 3. 

While finite planes of Lenz-Barlotti class 14 have been studied exten¬ 
sively, not much is known about finite planes of the related class 13, even for 
small values of the order. (Planes of both types have recently been surveyed 
by Kantor [6, 7], There is no known finite example of either class.) The 
purpose of the present paper is to give a slight extension of a theorem of 
Kantor [6, Theorem 6.3], and to investigate the existence of planes of class 13 
for specific values of the order n, where n ^ 250. Basic information concern¬ 
ing projective planes can be found in Dembowski [2], and Pickert [9]; we use 
the coordinate system of [9]. The principal results on finite planes of class 13 
are summarized in Theorems A and B. 

Theorem A Let n be a plane of class 13 with finite order n. Let the 
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coordinate quadrangle VUOE be chosen so that n is (V, OU) and (U,OV) 
transitive , and let R be the corresponding ternary ring. Then: 

(i) R is linear , multiplication is associative , and R satisfies the distribu¬ 
tive law a(b + c) = ab + ac. The multiplicative group R* is not commutative , 
the additive loop ( R , -1-) is not associative , ant/ R does not satisfy the distribu¬ 
tive law (b + c)a = ba + ca. 

(ii) R* contains at most one involution. 

(iii) The Sylow 2-groups of R * are cyclic or generalized quaternion. 

(iv) If n is even and if p is a prime that divides n — 1, then there exist 
integers x , y, z, not all zero , such that nx 2 -I- (— l) ip ~ 1),2 py 2 = z 2 . 

(v) Let G = R* x R*, let D = {(a, b) e G\b = a + 1}, G i = {1} x R*, 
G 2 = R* x {1}, and G 3 = {(r, r)|r e R*}. TTzen |£> | = n — 2 and 

(l<<)(lH-» + Ik- Ik- Ik- I». 

\deD f\deD / geG geGi geG2 geGj 

w/zere t/ze swms are ta/cen in the rational group algebra QG of G. 

(vi) If R* admits a noncyclic abelian 2-group as a homomorphic image , 
then n is a square. 

(vii) n — 1 is not a power of 2 if n =/= 9. 

Proofs of the statements in (i) can be found in [9, §3.5], Statement (ii) 
follows from the Ostrom-Luneburg theorem [2, p. 120], and (iii) follows 
from (ii) (Burnside [1, p. 132]). Statements (iv) and (v) are proved by Hughes 
([5, Theorem 3.2]; [4, §111. 1] respectively), and (vi), (vii) by Kantor [6, 
Theorem 6.3 and Corollary 6.4]. Note that only one distributive law is 
necessary for the proof of (v). 

If a e (R, +), let a + (-a) = 0. Then (R, +) is said to have the right , 
(left), inverse property if and only if, for all a, b e (R, +), (b + a) + ( — a) = b 
(a + (-a + b) = b, respectively). 

Theorem B Let n be a plane of class 13 with finite order n, coordinated 
as in Theorem A. Suppose that (R, +) has both inverse properties. Then: 

(i) (-1)+ 1 = 0,(-l) 2 = 1, and a(— 1) = (- l)a = -aforallae R. 

(ii) (R, +) is commutative. 

(iii) If a 2 = 1, then a = 1 or — 1. 

(iv) Let a 1. Then a + 1 = (— l)a 2 <^a 3 = 1. 

(v) If a 3 = b 3 = 1, if a, b ± 1, and if ab = ba , then b = a or a~ l , 

(vi) The Sylow 3-groups of R* are cyclic. 

(vii) n — 1 f 3 r or 2(3 r ). 

(viii) n is eveno 1 + 1=0. 



ON FINITE PROJECTIVE PLANES OF LENZ-BARLOTTI CLASS 13 


351 


(ix) If 3 divides n, then 1 + 1 = -1; if 1 + 1 = -1, then n = 1 or 
3 (mod 6). 

(x) n ^ 0 (mod 6). 

(xi) n ffe 2 (mod 4). 

(xii) n ^ 15 or 21 (mod 24). 

To prove (i), observe that 1 = 1 + (- 1 4- 1) by the left inverse property, 
whence -1 + 1 = 0. Then (-1) 2 + (-1) = 0, by the distributive law, so 
that ( — l) 2 = 1. Thus, by Theorem A(ii), a(— 1) = (— l)a for all a e R, and 
a(- 1) = -a by the distributive law. (This slight change in the proof of [6, 
Theorem 2.7(i)] is required because R now has only one distributive law.) 
Statements (ii)-(v) are proved in Kantor [6, Theorem 2.7, (ii)-(v)]. Note that 
the proof of Theorem 2.7(v) uses commutativity only in the subgroup <a, fc>, 
and hence proves the present statement (v). (The statement of Theorem 
2.7(v), omitted in Kantor [6], can be found in Kantor [7].) Now (vi) follows 
from (v) (Burnside [1, p. 131]), and (vii) follows from (vi) because R* is a 
nonabelian group with at most one involution. Since |R* | = n — 1, (i) and 
(iii) imply (viii). (See also Hughes [4, p. 513].) Statements (ix), (x) are proved 
by Hughes [4, Theorem II. 11], and (xi) follows from [6, Lemma 4.9] and a 
theorem of Hughes [2, p. 173, 12]. Finally, (xii) is proved in [7]. 

If a finite (V, OU), (U, OV) transitive plane n is of class > 13 and if R* is 
noncommutative, then n is either a “near-field plane” (of class IVa2 or IVa3), 
or a “dual near-field plane” (of class IVb2 or IVb3). In the former case n is 
(VU, VU) transitive and (R, +) is associative. In the latter case 7t is (0, 0) 
transitive, so that the coordinatization is not the standard one; here (R, +) 
has the right inverse property, but need not (and probably cannot) have the 
left inverse property. 

Theorem A(vi) is proved by applying a certain homomorphism to the 
identity in Theorem A(v). It is suggested in Kantor [6] that some similar 
argument might be used to show that R* cannot admit C p x C p as a homo¬ 
morphic image, where C p denotes a cyclic group of odd prime order p. This 
would be a major advance in the theory. Unfortunately the natural analo¬ 
gous procedure does not yield a result of such generality. However, we show 
in Theorem 1 and its corollary that the approach does furnish a weak 
numerical restriction, and that this has some modest application in exclud¬ 
ing non-prime-power orders for class 13. 

Theorem 1 Let n be a plane of class at least 13 and of finite order n . 
Suppose that R* admits C p x C p asa homomorphic image , where C p is a group 
of odd prime order p. Then the Legendre symbol ( q/p ) = 1 for each odd prime q 
that divides the square-free part of n. Equivalently , there exist integers x, y, z, 
not all zero , such that 


nx 2 + (— l) (p i)l2 py 2 — z 2 . 
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Proof Let n be of class at least 13 and of finite order n , coordinatized as 
in Theorem A. Let G = R* x R*, D = {(a, b) e G\b = a + 1}, 
G x = {1} x R*, G 2 = R* x { 1 }, and G 3 = {(r, r)|r e R*}. Let H = <a> x 
</?>, where a p = p p = 1 and a, /? 7 ^= 1. Suppose that there exists an epimor- 
phism <p: R* -> //, with kernel K. Let A, B be the complete inverse images in 
R* of </?>, <a> respectively. Then there exist epimorphisms (p 1 : R* -► <a> 
and (p 2 : R* -► </?>, with kernels A and B, respectively. Define \p: G~+H by 
ip(r , s) = <^> 2 ( 5 )) f° r each ( r » 5 ) e G. Then ip is an epimorphism with 

kernel A x B, and \p can be extended to an epimorphism *F: QG-+QH, 
where QG, QH are the rational group algebras of G and H. 

Let be the number of d e D such that \l/(d) = a i p j , for i, j = 0, 1, ..., 
p — 1; then a u is also the number of d e D such that \p(d ~*) = oc~ i p~ j . The 
restrictions of ip to G l9 G 2 clearly have kernels of order (n — 1 )/p, while the 
restriction of \p to G 3 has kernel of order (n - 1 )/p 2 , because A n B = K. 
Hence, on applying 'F to the identity in Theorem A(v), we find 

( Z a ij xi P J )( Z a km<*~ k P~ m ) =n + (n- l) 2 p~ 2 Z <* l p j 

\Uj = 0 J \k, m = 0 / i, j~0 

— (n — l)p _1 

j =0 

- (»-1 )p~ 2 Z a ‘P J - 

i,j=0 

It follows that 

( Z Z a km^~ k r m ) - z n(r,s)a r p s , ( 2 ) 

\i,j= 0 / \k,m = 0 / r,s = 0 

where 

p(0, 0) = n + (n - 1 ) 2 p~ 2 - 2(n - 1 )p _1 -{n- 1 )p~ 2 , 

H(0, t ) = n(t, 0) = (n - 1 )V 2 - (n - l)p _1 - (n - 1 )p~ 2 

if t + 0, (3) 

p(r, s) = (n— l) 2 p~ 2 — (n — l)p -2 if rs j* 0. 

Equating coefficients for the term a'/?* in (2), we have 

P~ 1 

Pfc S ) = Z a k + r, m + for f, S = 0, 1 , ..., P - 1 , (4) 

fc, m = 0 

where subscripts in (4) and hereafter are taken mod p. 

Let v(r, s) = Zj=o t*( r + h s + j), for r, s = 0, 1, p — 1, and let 


- Op - 1 'lot* 

i = 0 


(i) 
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Y, = £f=o <*i, i+t for t = 0, 1,..., p - 1. Then, by (4), 

p-1 lp ~1 \ P ~1 

v(r, 5) = ^ (S Qk+r+j t s + m+j\Qkm ~ S ^s-r + k-m^km 

k,m = 0 \j—0 f k, m — 0 


Hence 


p -1 

= S ^s-r + r a k,fc + t • 

fc, f = 0 


v(r, s) = 


lr- 


t-0 


r + t 


Y t 


(5) 


Let M be the p x p symmetric circulant matrix with M u = Y i+j - 2 > and let N 
be the p x p matrix with N tj — v(i, j), for i, j = 0, 1,..., p — 1. Then, by (5), 

MM t = N. (6) 

On calculating the v(i,;) from (3), we find N = nl + (n— l)(n — 4 )p~ 1 J, 
where / is the identity matrix and J is the matrix each of whose entries is one. 
Hence by Hughes [5, Lemma 3.1] N is positive definite, det N is the square 
of an integer, and for each odd prime q , the Hasse invariant C q (N) = (n, — 1) 
( q p ~ 1)/2 (n , p) q , where (a, denotes the Hilbert norm-residue symbol. It fol¬ 
lows from the properties of this symbol (see, e.g., [2, p. 18]), and from the 
quadratic reciprocity theorem that C q (N) = (q/p) if q divides the square-free 
part of n and that C q (N) = 1 otherwise. Now, by (6), the matrix N is ra¬ 
tionally congruent to the identity matrix. By the Hasse-Minkowski theorem 
(Hasse [3]) this is possible only if C q (N) = 1 for each odd prime q. Hence 
(q/p) = 1 for each odd prime q that divides the square-free part of n. Equiva¬ 
lently (cf. Hughes [5, Theorem 3.2]), the equation nx 2 + (— l) (p “ 1)l2 py 2 = z 2 
has a nonzero solution in integers. 

If n and p satisfy the conclusion of Theorem 1, the Minkowski-Hasse 
theorem ensures that there exists a rational matrix M that satisfies (6). One 
may still ask whether Eq. (1) has a solution in nonnegative integers. For 
n = 28 and p = 3 (which is the smallest case of interest in the context of 
Theorem 2 below), Eq. (1) has the solution a 00 = 2, a 01 = 1, a 02 = 5, 
0io = flu =4, a 12 = 2, a 20 = 3, a 2i = 4, a 22 = 2. 

The conclusion of Theorem 1 is always satisfied if n is a prime power and 
p divides n — 1. When n is even. Theorem 1 gives less information than 
Theorem A(iv), and when n = 1 or 2 (mod 4), the values of n to which 
Theorem 1 can be applied may well be excluded by the Bruck-Ryser 
theorem [2, p. 144, 13]. However, Theorem 1 does imply the nonexistence of 
planes of class 13 with order n for infinitely many n = 3 (mod 4) that are not 
otherwise excluded. 


Corollary (a) There is no plane of class 13 with order n = 1 + 2 p\ 
where p is an odd prime , if the square-free part of n is divisible by a prime q such 
that (q/p) = — 1. 
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(b) There is no plane of class 13 with order n = 1 + 2 p r if p is a prime = 
1 (mod 12) and if r(p — l)/3 # 1 (mod 3). 

Proof (a) If 7i is of class 13 with order n = 14- 2 f, then | R* | = 2 p r . 
By Theorem A(ii), R* = <<r> x T, where a is the unique involution in R* 
and T is the Sylow p- group of R*. Since R* is nonabelian, T is nonabelian, 
whence r ^ 3. It follows that T, hence also R *, admits C p x C p as a homo¬ 
morphic image (as is easily shown by induction on r). Now apply Theorem 
1 . 

(b) Suppose that p = 7 (mod 12) and that r(p — l)/3 =£ 1 (mod 3). 
Then 3 divides n, but 3 2 does not because 1 -I- 2p r = 1 + 
2[1 + (p — l)] r = 3 + 2r(p — 1) (mod 3 2 ). Since p = 3 (mod 4), the quadra¬ 
tic reciprocity theorem ensures that (3/p) = — (p/3) = — 1. 

As was noted some time ago in Yaqub [11], it is not hard to show that 
most non-prime-powers < 100 cannot be the order of a plane of class 13. In 
Theorem 2 we list all orders n ^ 250 that cannot yet be excluded. Non- 
prime-powers are in bold-faced type, and values n for which there exists a 
(genuine) near-field of order n carry an asterisk; the remaining values are 
primes, except for 125 = 5 3 , for which no near-field exists. (See, e.g., [2, pp. 
228-231].) 

Theorem 2 If there exists a plane of class 13 with order n ^ 250, then 
n = 13, 25*, 28, 29, 37, 41, 43, 45, 49*, 53, 61, 64*, 73, 76, 79, 81*, 85, 89, 97, 
101, 109, 111, 112, 113, 115,117, 121*, 125, 127, 137, 145,148, 149, 151, 153, 
163, 169*, 172, 173, 185,187, 193, 196, 205, 211, 221, 223, 225, 229, 233, 235, 
241, 244, or 245. 

Proof The values n = 40, 56, 58, 82, 130, 156, 184, 202, 204, 220, 226, 
232, 248 are excluded by Theorem A(iv), the values n = 17, 65 by Theorem 
A(vi), and the value n = 55 by the corollary to Theorem 1. All other values 
except n = 9 are excluded either by the Bruck-Ryser theorem or by the fact 
that every group of order n — 1 that contains at most one involution is 
abelian. 

Suppose n = 9. Then, by Theorem A(iii), R* is the quaternion group 
<a, b | a 4 = b 4 = 1, a 2 = b 2 , b~ l ab = a~ 1 >. Let z denote the unique involu¬ 
tion in R*. (Note that we cannot assume z = - 1.) Then R* = {1, z, a, b, za, 
zb, ab, ba}. The group Aut R* is transitive on the elements of order 4, and the 
subgroup that fixes a is transitive on R* — <a>. We construct all nonisomor¬ 
phic loops (R, +) that are compatible with the distributive law, by consider¬ 
ing the successive possibilities for 1 + x as x runs through R*. Whenever 
possible, we use the existence of automorphisms of R* to reduce the number 
of cases. Thus, for example, if 1 + 1 is of order 4, we may assume without 
loss of generality that 1 + 1 = a. Then 1 + z j= 1 or z, and 1 + z =/= za be¬ 
cause z + z = za; hence we can assume that 1 + z = 0 or b. In this way we 
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find only the following six loops: 


(i) 1 + 1 = 0, 

1 + z = a, 1 + a = fo, 

1 + 6 = z, 

1 + za = ab 9 

1 + zb = za, 1 + ab 

= ba, 1 + ba = zb. 



(ii) 1 + 1 = 0, 

1 + z = a, 1 + a = b, 

1 + b = z6, 

1 + zo = 6o, 

1 + zb = ab, 1 + ab 

= z, 1 + ba = za. 



(iii) 1 + 1 = z. 

l + z = 0, 1 + a = ft, 

l + 6 = o, 

1 + zo = 6o, 

1 + zb = ab, 1 + ab 

= zb, 1 4- ba = za. 



(iv) 1 + 1 = z. 

1 + z = 0, 1 + 0 = 

1 + b = ba. 

1 + zo = ab , 

1 + zb = za, 1 + ab 

= zb, 1 + ba = a. 



(v) 1 + 1 = z, 

1 + z = 0 , l + o = b 9 

1 + 6 = 06 , 

1 + zo = 6 o, 

1 + zb = za, 1 + ab 

= o, 1 + ba = zb. 



(vi) 1 + 1 = z. 

l + z = 0 , l + o = 6 , 

l + 6 = o, 

1 + zo = 06 , 

1 + zb = ba, 1 + ab 

= zo, 1 + 6 o = zb. 




The first four loops are nonplanar; this can easily be verified by finding 
r, x, y 6 R with rj*l 9 x£y, and (r + x)~ *(1 + x) = (r + y)~ *(1 + y). (See, 
e.g., Yaqub [10, p. 377].) It follows that loops (v) and (vi) must correspond to 
the near-field plane of order 9 and its dual (of class IVa3 and class IVb3, 
respectively) since these planes must appear as solutions. Hence there is no 
plane of class 13 with order 9. 

Henceforth we shall assume that ( R , +) has both inverse properties. 
Then Theorem B applies, and in particular ( R , +) is commutative. The 
following lemma is essentially contained in Kantor [7], but we include the 
proof because cases (ii) and (iv) of part (b) do not arise there. (See also 
the proof of Hughes [4, Theorem II. 11].) 

Lemma Suppose that n is finite , of class at least 13, and that (R, +) has 
both inverse properties. Suppose also that a, b e R* and that a + fe = — 1. Let 
E be the set consisting of the distinct elements from among o, o“ \ 6,6” o“ *6, 
b~ x a. Then: 

(a) a~ l + o _1 6 = — 1 = 6 _1 + 6 _1 o. 

(b) Either (i) | E | = 6 , or (ii) E = (c, c ~*} where c 1, c 3 = 1, and 
c + c~ l = — 1, or (iii) E = {1} and 1 + 1 = — 1, or (iv) E = {1, c, c~where 
c 1 , — 1 and 1 + c = - l = c _1 + c _1 . 

(c) R* — {— 1} can be partitioned into sets of type (i)-(iv). If a is of order 
3, then a belongs to a set of type (ii). There is at most one set of type (iv). 

(d) If N is a subgroup of index 2 in R*, and if E is a member of the 

partition such that | E | =6, then | N n E | = 2 or 6. 

Proof (a) If a, b e R* and if a + b = — 1, then a ± — 1 b. By 

Theorem B(i), (— l) 2 = 1 and x(— 1) = (— l)x for all x e R*. Thus a + b = 

— 1 => o = —1 + (— 1 )b, by the inverse property, => — 1 = a~ 1 + a~ l b by 
the distributive law. Similarly, b~ x + b~ x a = —1. 
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(b) Suppose that |E| <6. Then, without loss of generality, we may 
assume that a = a -1 , 6, ft -1 , a~ l b, or b~ l a . If a = a -1 , then a = 1 since 
a ± — 1; similarly, if a = b~ l a, then 6=1. Now if a = b = 1, then E is of 
type (iii), and if a = 1 ^ b (or if b = 1 =£ a), then E is of type (iv), with c = b 
(c = a, respectively). Again, if a = b ^ 1, then E is of type (iv), with c = a" 
Suppose next that a = b~\ with a ^ 1. Then a + a~ 1 = — 1 => a 2 = (— 1) x 
(1 + a) => 1 + a = ( — l)a 2 . Hence a is of orddt 3, by Theorem B(iv), and E is 
of type (ii). Suppose finally that a — a~ l b with a =/= 1. Then a + a 2 = — 1, so 
that again 1 + a = (— l)a 2 and E is of type (ii). 

(c) Clearly, R* — {— 1} can be partitioned into sets E of type (i)—(iv). If 
a is of order 3, then a + a~ 1 = — 1 by Theorem B(iv), whence a belongs to a 
set of type (ii). If E is of type (iv), then c = (—1)(1 + 1). Hence there is at 
most one such set. 

(d) Suppose | E | =6 and that N is of index 2 in R*. If a, b e N, then 
| N n E | =6. Otherwise, | N n E | =2 because N is of index 2. 

We shall call the above sets of type (i)-(iv) “E-sets”; a E-set is “proper” if 
and only if | E | =6. 

Theorem 3 Suppose that n is of class 13 with order n, and that (R, +) has 
both inverse properties. 

(i) If 1 + 1 is of order 4, then n contains a subplane of order 5. 

(ii) If 1 + 1 is of order 3, then n contains a subplane of order 7. 

On) i + i is not of order 6. 

(iv) Ifn— 1 = 4p, where p is a prime ^ 3, then 1 + 1 is of order 4. 

(v) If p is a prime > 3, then n — 1 f 6p. 

(vi) If n is even and if R* = <a, N>, where a 3 = 1 and N is a normal 
subgroup of index 3 in R*, then a cannot act fixed point free on N. 

(vii) If p is a prime > 3, then n — 1 ± 3p. 

(viii) If p is an odd prime = — 1 (mod 3), then n — If 3 p 2 . 

Proof (i) Suppose that 1 + 1= 0, where a has order 4. Then n is odd, 
and a 2 = — 1, 1 by Theorem B(viii). Using the inverse properties and the 

commutativity of (R, +), we find 1 + a = aT 1 and 1 4- a~ 1 = — 1. Hence the 
set {0} u <a> is closed under addition and multiplication, and must there¬ 
fore coordinatize a subplane of order 5. 

(ii) Suppose that 1+1 = 6, where 6 has order 3. Then n is odd and 
-1 f 1. Let a = (- 1)6, so that a has order 6 and 6 = a 4 . By Theorem B(iv) 
1 + a 4 = a 5 and 1 + a 2 = a. Also, since 1 + 1 = a 4 , we have 1 = a 4 - 
1 = a 4 + a 3 , whence 1 + a = a 3 and 1 + a 5 = a 2 . Hence {0} u <a> is closed 
under addition and multiplication, and must coordinatize a subplane of 
order 7. 

(iii) Suppose that 1 + 1 = a, where a has order 6. Then n is odd, and 
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a = (— l)fc, where b has order 3. But then, by Theorem B(iv), 1 + ft" 1 = 

( — l)b, a contradiction since b =/= 1. 

(iv) Suppose that n — 1 = 4p, where p is a prime ^ 3. Then, since R* is 
nonabelian and contains a unique involution, R* = <a, b | a 4 = 1 = b p , 
a" = b~ 1 >. The subgroup JV = <a 2 , is of index 2 in /**, and R* — N 
consists of all elements of R * that have order 4. If £ is a proper 2-set, then by 
the lemma, j 2 n (K* — JV) | = 0 or 4. Since | R* — JV | = 2p = 2 (mod 4), it 
follows that there exist elements c, c ~ 1 of order 4 which belong to a 2-set 2 X 
that is not proper. Clearly, 2j is of type (iv). Hence 1 + 1 is of order 4. 

(v) Suppose n — 1 = 6p, where p is a prime > 3. Since R* contains the 
unique involution - 1 and since 3 p is odd, K* = <- l>xS with | S \ = 3p. 
Since p > 3, the Sylow p-group of S is normal, and since R* is nonabelian, it 
follows that p = 1 (mod 3) and that S has p Sylow 3-groups. Hence S con¬ 
tains p 2-sets of type (ii). Let n l9 n 2 be the numbers of elements in S, R* < S , 
respectively, which are contained in proper 2-sets. If 1 -h 1 = — 1, then 

= p — 1 and n 2 = 3p — 1 since R* — S contains no element of order 3. If 
(— 1)(1 + 1) = c j=- 1, then n 1 = p — 3 and n 2 = 3p — 1 if c e S, while 
Wi = p - 1 and n 2 = 3p — 3 if c $ S. Hence n x ^ p — 1 and n 2 >3p - 3. But, 
by the lemma, if 2 is a proper 2-set such that 12 n (R* - S) | >0, then 2 
contains four elements of R* — S and two elements of S. Hence n 2 ^ 2n v 
This yields the contradiction 3p — 3 ^ 2(p — 1). 

(vi) Suppose that n is even and that R* = <a, N > where a 3 = 1 and N is 
a normal subgroup of index 3 in R*. We can assume that | N | > 1, since 
otherwise n is of order 4, hence desarguesian. Suppose that a acts fixed point 
free on N , i.e., that a' 1 xa ~ x with x e N implies x = 1. Then \N\ = 
1 (mod 3), and <a> is a Sylow 3-group of R*. Now xax ~ 1 = a with x e N 
implies x = 1, and xax~ 1 = a ~ 1 implies x 2 ax~ 2 = a, whence x = 1 because 
n is even. Hence R* contains | N \ Sylow 3-groups, and every element of R* 
either belongs to N or has order 3. By Theorem B(iv) if c is of order 3, then 
1 + c = c" 1 since here -1 = 1. It follows that if x e JV, x 1, then 
1+xeJV. Thus, again because 1 -f 1 = 0, the set {0} u N is closed under 
addition and multiplication, and must therefore coordinatize a subplane of 
order 1 + \N\. By Bruck’s theorem [2, p. 145, 18], this implies that 
(1+ |N|) 2 ^ 1 + 3|N|, i.e., that \N\ — 1, a contradiction. 

(vii) Suppose p is a prime > 3. If n — 1 = 3p, the Sylow p-group N of 
R* is normal and of index 3, and R* — <a, N ) where a is any element of 
order 3. Since | N \ = p and since a cannot act fixed point free on N by (vi), 
R* is abelian. 

(viii) Suppose p = — 1 (mod 3) and that n — 1 = 3p 2 . Again, the Sylow 
p-group JV of R* is normal and of index 3, and R* = < a , JV> where a has 
order 3. The centralizer of a in JV is nontrivial by (vi), and cannot be of order 
p since then a would act fixed point free on the p 2 — p remaining elements of 
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N, where p 2 — p = 2 (mod 3). Hence a centralizes N , and, since N is abelian, 
R* is abelian. 

Theorem 4 There is no plane n of class 13 and of order n such that (R, +) 
has both inverse properties for the following values of n ^ 250; n = 13,25*, 28, 
29, 43, 45, 76, 79, 111, 112, 115, 117, 163, 187, 223, 244. 

Proof The values n = 45, 111, 117 are excluded by Theorem B(xii),and 
the values n = 28, 163, 244 by Theorem B(vii). If n = 43, 79, 115,187, or 223 
we can apply Theorem 3(v). If n — 13 or 29, then by Theorem 3(iv) and (i), n 
contains a subplane of order 5; in each case this contradicts Bruck’s 
theorem. The values n = 76, 112 are excluded by Theorem 3(viii), (vii), re¬ 
spectively. Thus it remains to be shown that n ^ 25. 

If n = 25, then, since R* contains a unique involution, there are four 
nonisomorphic possibilities for R* [1, §126]. These are: 

(i) <a, 6, c | a 4 = fc 4 = 1, a 2 == 6 2 , b~ 1 ab = a~ \ c 3 
6 -1 cfe = c>, 

(ii) <a, 6, c|a 4 = f> 4 = 1, a 2 = b 2 , b _1 ab = a _1 , c 3 
b~ l cb = cT 1 ), 

(iii) <a, 6, c|a 4 = = 1, a 2 = 6 2 , b~ 1 ab = a~ 1 , c 3 

c _1 6c = ab>, 

(iv) <a, 61a 8 = 1 = 6 3 , a -1 6a = 6' 1 ). 

We note that the regular and irregular near-fields of order 25 have multi¬ 
plicative groups of type (iv) and (iii), respectively. 

Suppose first that R* is of type (i) or (ii). Then, in each case, R* contains 
exactly two elements of order 3, namely c and c" 1 , and the group 
Ni = <a, c> is of index 2 in R*. Since |R* — N x | = 12 = 0 (mod 4), and 
since 1 , — 1 , c, c~ x e N , it follows from the lemma that each Z-set that 
contains an element of R* — N t is proper; there are three such sets, each 
containing two elements of N t . The remaining six elements of include 1, 
-1, c , and c _1 . Since {c, c -1 } is a L-set of type (ii), we cannot have 
1 + 1 = — 1. Hence N t contains a Z-set of type (iv), say {1, d, d~ 1 }, with 
1 + d = - 1, where also 1 + 1 = (— 1 )d. Now N 2 = <6, c> is also of index 2 
in R* for both types, and the same argument shows that {1, d , d~ 1 } c= N 2 • 
Since iV 1 nN 2 = < — l>x <c>, the element is of order 3 or 6, whence 
1 + 1 is of order 6 or 3. By Theorem 3(iii), 1 + 1 is not of order 6. If 1 + 1 is 
of order 3, then by Theorem 3(ii), n contains a subplane of order 7; this 
contradicts Bruck’s theorem. Hence R* is not of type (i) or (ii). 

Suppose next that R* is of type (iii). Write z — a 2 = b 2 = — 1. Then, 
besides 1 and z, R* contains six elements of order 4, eight elements of order 
3, and eight elements of order 6. The elements of order 3 form four Z-sets of 
type (ii), and it follows from the lemma that there exists a Z-set {1, d, d~ of 


= 1, a~ 1 ca = c, 
= 1, a~ l ca = c, 
= 1, c~ 1 ac = b 9 
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type (iv), with 1 + 1 = zd. As has just been shown, d cannot be of order 3 or 
6. Hence 1 + 1 is of order 4, and we can assume without loss of generality 
that 1 + 1 = a. We are left with four elements of order 4 and eight elements 
of order 6 with which to construct two proper Z-sets, say Z x and Z 2 . The 
elements of order 4 cannot all be in the same Z-set. So suppose that 
b, b~ 1 eZj and that ab, a~ x b e Z 2 . Then there exists an element x of order 
6 such that b + x = z. If x = zc, then b ~ 1 + b~ x zc = z, i.e., b~ 1 + be = z, 
and this is impossible because be has order 3. Similarly, we find that 
x bc~ \ a~ 1 bc, abc~ x . Hence either (1) b + zc~ 1 = z, (2) b + ac~ 1 = z, (3) 
b + b~ x c = z, or (4) b + a~ x c — z. 

In case (1), Z t = { b , ft" 1 , zc -1 , zc, fee -1 , a _1 c}, whence Z 2 = {ab, a~ x b, 
ac -1 , ft _1 c, aT x bc, abc~ x }. There exists an element ye Z 2 such that y has 
order 6 and ab + y = z. If y = ac -1 , then 1 + b~ l c~ 1 — b~ x a~ x z, which 
contradicts the fact that b~ x c~ x has order 3. Similarly, y^abc~ x . If 
y = a~ x bc, then 1 + zc = b~ x a~ x z. But ft + zc' 1 = z=*z + c _1 = ft=> 
zc + 1 = cb. This is a contradiction, since b~ x a~ x z^cb. Similarly, 
y zfz b~ x c. Hence case (1) cannot occur. The same kind of calculations show 
that case (2) cannot occur, and that in case (3) the only possibility is 
ab + bc~ x = z. The loop obtained in case (3) is not planar since, for exam¬ 
ple, (1 + b)~ x (a + b) = aT x bc = (1 + a' x c~ x )~ x (a + a~ x c~ x ). Finally, in 
case (4), we find that the only possibility is ab + zc~ 1 = z; this must yield the 
plane of class IVa2 over the irregular near-field of order 25 since this plane 
must appear as a solution. Hence there is no plane of class 13 with R* of type 
(iii). 

Suppose, finally, that R* is of type (iv). Write a 4 = z = — 1. Then, in 
addition to 1 and z, R* contains two elements of order 3, two elements of 
order 6, four elements of order 12, and 12 elements of order 8. Let 
N = < a 2 , h}; then N is of index 2 in R* and all elements of R* — N are of 
order 8. By the lemma, since | JR* — N \ =12, the element 1 + 1 must belong 
to N, and there are three proper Z-sets that contain elements of R* — N. 
Since these use altogether six elements of N, and since {b, b~ x } a N is the 
only Z-set of type (ii), there exists a Z-set {1, d, d~ 1 } a N of type (iv), with 
1 + 1 = zd. It was shown earlier that d cannot be of order 3 or 6. Hence d has 
order 12 or 4. If d has order 12, there exists a Z-set Z that contains four 
elements of order 8 and the two elements of order 4. Moreover, there exist 
x, y e Z such that x, y have order 8, x + y = z and x~ x y has order 4. It 
follows that y = x" 1 or x 3 . Since x + x“ 1 = z only if x 3 = 1, we must have 
x + x 3 = z. Then 1 + x 2 = x“ x z = x 3 . Let 1 + 1 = t, so that t — zd has 
order 12. Then x 2 + x 2 = x 2 t. By planarity, (x 2 + 1) _1 (1 + 1) ^ (x 2 + x 2 ) 
~ X (1 + x 2 ), i.e., x 5 t ^ t~ x x. But, since t has order 12, t = x x b x = b x x t where 
Xj = x 2 or x 6 and b t =b or b~ x . Thus x 5 t = zxx l b 1 and 
t~ x x = xr x bi x x = xf x xb t = zxx x b x . Hence 1 + 1 is not of order 12. 
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We can now assume that 1 + 1 is of order 4. In this case there exists a 
proper £-set 2^ that contains four elements of order 8 and the two elements 
of order 6. Since Aut R* is transitive on the elements of order 8 and on the 
elements of order 3, we can assume without loss of generality that a, zb el x 
and that a + zb = z. Then £ x = {a, a " \ zb, zb~ 1 , a 3 b, a 5 b}. (Note that we do 
not specify at this stage whether 1 + 1 = a 2 or a 6 since this would distin¬ 
guish between a and a -1 in (R, + ).)There are two other proper £-sets, Z 2 and 
Z 3 , each of which contains four elements of order 8 and two elements of 
order 12. Since neither Z 2 nor £3 can contain four elements from the same 
Sylow 2-group of R*, we can assume that a 3 , a 5 , a 2 b~ 1 , a 6 b~ i e Z 2 . Then 
either (1) a 3 b~ \ a 5 b ~ 1 g Z 2 or (2) ab ~ 1 , a~ l b ~ 1 g Z 2 . 

In case (1) we have either (i) a 3 + a 5 b ~ 1 = z or (ii) a 5 + a 3 b ~ 1 = z (since 
a 2 b ~ l , a 6 b~ l e Z 2 ). Furthermore, Z 3 = {ab -1 , a~ 1 b~ 1 , ab, a~ 1 b, a 2 b, a 6 b}, 
where either (iii) ab ~ 1 + a~ x b = z or (iv) a~ 1 b~ 1 + ab = z. Now (i) and (iv) 
are incompatible since (i) implies 1 + a 5 = a 6 b -1 , while (iv) implies 
1 + a b ~ 1 = a 6 b~ l . Similarly, (ii) and (iii) are incompatible since (ii) implies 
1 + a 3 = a 2 b ~while (iii) implies 1-1 - a' x b~ l — a 2 b~ l . If (i) and (iii) hold, 
we find (a 2 + a _1 ) _1 (l + a~ i ) = a -1 = ( a 2 + ab)~ 1 ( 1 H- ab); if (ii) and (iv) 
hold, then (a 2 + fl 5 ) _1 (l + a 5 ) = a 3 fc = (fl 2 + a 2 i) _1 (l + a 2 i). Thus the 
loops obtained from case (1) are nonplanar. 

In case (2) we find by considering £ 2 that either (v) a 3 + ab ~ 1 = z or (vi) 
a 5 + a~ l b~ l = z, and by considering X 3 that either (vii) ab + a 3 b~ 1 = z or 
(viii) a~ i b + a 5 b ~ 1 = z. Here it turns out that (v) and (vii) are incompatible, 
as also are (vi) and (viii). If (v) and (viii) hold, then (a 2 4- a 3 ) x 
“^1 + a 3 ) = a 3 — (a 2 + a 5 b~ 1 )~ l (i + a 5 b~ l ), and no plane is obtained. So 
suppose that (vi) and (vii) hold. In order to determine the complete loop 
(R, +), we must define either 1 -1- 1 = a 2 or 1 + 1 = a 6 . But if 1 + 1 = a 2 , 
then (a + a _1 ) _1 (l + a’ 1 ) = a 2 b = (a + h -1 ) -1 ^ + b~ l ). Hence, to within 
isomorphism, there exists only one planar loop (R, +) that is compatible 
with R*, and this must yield the plane of class IVa2 over the regular near¬ 
field of order 25. This completes the proof that there is no plane of class 13 
and of order 25 that satisfies the hypotheses of Theorem 4. 

By Theorems 2 and 4, there are only 12 nonprime powers < 250 that are 
still “possible” orders for a plane that satisfies the conditions of Theorem 4; 
the three smallest of these are 85,145, and 148. It is perhaps more significant 
that there is no plane of this kind with order 25. However, for n = p 2 , where 
p is a prime > 3, it may well make a difference whether p = 1 or 3 (mod 4) 
(since this determines whether the Sylow 2-groups of R* are cyclic or gener¬ 
alized quaternion for the regular near-field), and whether p = 1 or 2 (mod 3) 
(since the elements of order 3 clearly play an important role in limiting the 
possible Z-sets). On both counts, and for other obvious reasons, the case 
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n = 49 is of particular interest; however, it is probably too large to handle by 
the methods used here. In principle, the question of the existence of planes of 
class 13 with “small” order could be settled by computer (cf. Pankin [8]). The 
assumption that ( R , +) has both inverse properties reduces the number of 
additive loops substantially. (Even n = 13 would be very laborious to decide 
by hand in the general case.) However, there is no reason to suppose that 
finite planes of class 13 must satisfy the assumption. 
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Introduction 

A cyclic algebra was defined by L. E. Dickson as a hypercomplex system 
H generated by two elements a, b over a field F such that: 

(1) the subalgebra of H generated by a alone is a cyclic extension E of 
finite degree n over F where the automorphism group of E over F is gen¬ 
erated by a certain automorphism a of order n ; 

(2) b is invertible and 


b *ab = ao. (1) 

Olga Taussky studied in [2-4] the question of what one can say about the 
reduced norm of the commutator 

c = [a, 8] = aS - 8a (a e F(a\ 8 e F(a) + F(a)H) (2) 

in the case that n = 2 and if is a matrix algebra over F. By her results she 
established an interesting relation between the multiplicative commutator 
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equation 

(a, b) = a~ 1 b~ 1 ab = a~ 1 (aa) (3a) 

and the little studied additive commutator equation motivating the present 
investigation. 

Let us mention right away that for arbitrary a, a l9 a 2 of F(a) and ele¬ 
ments a x + ba 2 of F(a) + bF(a) there holds the commutator equation 

[a, b ] = ba 2 (aa — a) (3b) 

where the elements on the right-hand side of (3a) are characterized as the 
elements of E of norm 1 according to Hilbert Satz 90, whereas the elements 
da — a occurring on the right of (3b) of the additive version are charac¬ 
terized as the trace zero elements of E. 

1. Simple Properties of Cyclic Algebras 

It follows from the definition of a cyclic algebra H that H is a centrally 
simple associative algebra with the n 2 basis elements a l b k (0 < i < n, 
0 < k < n) 9 that 

0 ^ b n = p e F (4) 

and that is uniquely determined by H and the structure of E over F 
modulo the norm group N e/f (E\0), the multiplicative group formed by all 
nonzero elements of F that are norms of some element of E over F. 

The algebra H is a ring of matrices of degree n over F precisely if /? is in 
the norm group. 

Indeed, if a satisfies the irreducible equation f(a) = 0 where 

f(t) = t n -« l r- 1 + ••• + (- IK faeF, 1 <i<n) (5) 

is some irreducible polynomial of degree n over F and if 

d~ l a = Yj Pik ak ~ 1 (Pik e 1 < i < n 9 1 <k<n), (6) 

* = i 

then the companion matrix 

f ° 0 , 1 


L(-l)" 1( *n ••• <*lj 

and the matrix b = (/?,*) generate the full matrix algebra of degree n over F 
presented as a cyclic algebra such that fi = 1, unique up to a norm factor. 
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In general there is the split model of H generated by the two matrices 

Pi 

0 Pi 

0 •• 

•. fin-1 

Pn 0 • 0 

with coefficients a, aa,<xo n ~ \ fi u ...,P„ in some splitting extension ® of/ 
over F such that 

f(t)= n(t-a<r J_1 ) (7a) 

i= i 

P = plp2 - Pn, (7b) 

which gives a concrete model of the cyclic algebra H over F. 

As is well known, any finite-dimensional centrally simple associative 
algebra H over the field F contains maximal commutative subalgebras that 
are finite separable extensions of F. In particular, if H is of dimension 4 over 
F, then H contains a separable quadratic extension E of F. This means that 
in the latter case E is a cyclic extension with involutory generating automor¬ 
phism a over F such that there is some invertible element b of H so that for 
any generator a of E over F we have (1) and H is cyclic. 

On the other hand, it has been shown recently that not every centrally 
simple hypercomplex system is a cyclic algebra. 

We observe that 


n- 1 



H=Y b k E 

(8a) 


k = 0 


and according to (1) 

ab k - b k ao k = 0 

(8b) 

so that 




[i b k , a] = b k a — ab k = b k (a — aa k ) 

[E + b k E ] = b k ker tr (k e Z, gcd(/c, n) = 1) 
where tr is the F-linear trace mapping 

F-F: tr 

n — 1 

X tr = X xa j (x e E) 

j=o 

of E on F. Note that the well-known additive version of Hilbert Satz 90 just 


(8c) 

(8d) 


a = 


OLO 


oca 


(X(T 


, b = 
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tells us that 


E(l — a) = ker tr. (8e) 

In the event that the degree n of E over F is not divisible by the character¬ 
istic of F a considerable simplification of the problem is achieved by adjoin¬ 
ing to the field of reference F a primitive nth root of unity f such that over 
the new field of reference F x = F(Q there arises the generalized cyclic 
algebra H x = F x <g) F H = F X H containing the separable normal commuta¬ 
tive algebra E x — F x ® F E = F X E = F x (a) of dimension n over F x with 
generating automorphism 


E x -+E X : a x 

(a (x) y\)g x = a (x) (q<j) (a e F x , r\ e E) 

F F 

and with standard generators a, b. 

The simplification is achieved by Lagrange’s construction according to 
which 


Ei = F i(a,), 0 ^ a n ! = a e F u |a/(F t \0) n | = n. 

(9) 

C5 

II 

jfc 

(10) 

and Eq. (9) assumes the form 

b i a l b = Ca l 

(11) 

or, without inversion, 

a x b — £ba x — 0 . 

(12) 

The latter form of condition (11) has the virtue that for given a x 

the equation 

a x x — = 0 

(13) 


defines the n-dimensional linear solution space E x b = bE x . 

Any one of its regular elements together with any generator of E x over F 
presents H x as sl cyclic algebra. 


2. A Theorem on Cyclic Algebras 

Generalizing a recent theorem of O. Taussky for n = 2, we have 
Theorem 1 (a) For given a of E the norms of the Lie products 
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over F form the set 

where ( is a primitive nth root of unity. 

(b) The norms of the Lie products 

[S,E] 

(6 a fixed regular element of E + bE) over F run through all of 

fi)pN(F). 

(c) For any order A of H over the integral domain A n F with quotient 
field F and for given a of A n E the norms of the Lie products 

[x, a] (x g (E 4- bE) n A) 

over F form the set 

tj<2)pN(a — ckt){N(x)\x e 91 } 
when 91 is the A n E-ideal 

Sa = {y\ye (((b- l E + E) n b~ 1 A) + b~ l E) n £)}. 

Proof Note that the minimal equation for b over F is 

b n -p = 0 

so that 

N(b) = 

The remainder follows from (8b)-(8d). 

The striking relationship between any two quadratic subfields of the ring 
of 2 x 2 matrices over the field F established by O. Taussky’s theorem 
stating that the determinant of the commutator of any pair of generators of 
the two subfields is equal to a negative norm into F from each of the two 
extensions is generalized by the theorem only to pairs of subfields of the ring 
of n x n matrices over the field F such that one of them is a radical extension 
generated by an element transforming the other extension according to an 
automorphism of order n. 

In order to test the general relationship between the cyclic extension F(a) 
and the norm of the commutator [a, 6] where 

6= X a k bk ~ S a k = Pk(<*)> <* = P( a Y> p k(t) e F[t ], P(t) e F[t ]; 1 < k < n 

k= o 
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we use the split model with = /J 2 — '" = P n = y 



OLG 


'0 

1 

■ 


a (7 2 



0 1 


a = 


II 

-0 



1 


0CG n 


1 

0 ••• 

0 


where we assume y to be a solution of the equation 

7 n = P (14a) 

It follows that 


[a, 6] = £ ((P(ao-‘) - P(a.<j k ))P (i _ k) Xa.o i )y {i k) ") 

i, k 


// • 1 \ \i — k 

if 

i-k>0 4 , \ 


if 

l<i<n 9 l<k<n\ 
1 — k < 0, / 


so that the norm of the commutator of a, 6 appears as the determinant of the 
matrix on the right, which turns out to be a homogeneous form of degree n 
in both the coefficients of P as well as (jointly) in the coefficients of the P k . In 
the cubic case there appears the simple result 

N([a , a 0 + ba x + b 2 a 2 ]) = N(aa - a)(pN(a 1 ) - P 2 N(a 2 )) 

(a 0 , a u a 2 e E y n = 3). (14b) 

In general there appears a norm form with many terms. 

A deeper question concerns the Lie algebra, characteristics of the direct 
decomposition (8a). This question will be discussed in Section 3. 


3. The Spectral Decomposition Belonging to Maximal Commutative 
Subalgebras 

It is well known that a ring H always is a bimodule relative to any 
subring E of H. 

Let us follow up this remark by a more detailed analysis in case if is a 
unital hypercomplex system over a field F and £ is a commutative F- 
subalgebra containing 1 H . 

In that case we interpret H as a representation space of the tensor 
algebra E (2) = E ® F £, a unital commutative hypercomplex system over F, 
according to the operational rule 

£ (2) ->End F H: A 
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h((a l (x) a 2 ) A) = a l ha 2 ( h e if; a l9 a 2 e £). (15a) 

Let e u e 2 , e s be the primitive idempotents of E . It follows that the 
idempotents e { x e k of £ (2) form a system of orthogonal idempotents of £ (2) 
adding up to 1 E(2) =1x1. Accordingly, we obtain the decomposition 

H = i t'eiHe k (15b) 

i=l k =1 

of if into the direct sum of the £-bimodules e t He k which are characterized as 
the set of all elements of if with e t as left identity and e k as right identity. 

The multiplicative interaction of the components is given by the rule 

i e i f ie k ){e i ,He k ,) £ <5 t ,(i, k, i', k' = 1, 2, 1), (15c) 

which implies the Lie interaction rule 

\e { He k , e v He k ] ^ d K v eiHe k , + 5 k , ie v He k . (15d) 

We remark that the subalgebras e { He t of H are certainly simple F- 
subalgebras in the event that H is centrally simple over F. 

In order to make further progress assume now that: 

(1) if is centrally simple over F, of degree n ; 

(2) £ is a subfield of if that is a maximal commutative subring, in other 
words £ is an extension of £ 1 H ; 

(3) £ is separable over £. 

It follows that £ (2) is semisimple. Hence there are finitely many primitive 
idempotents d l9 ... 9 d q of £ (2) that are orthogonal adding up to 1. £ (2) is the 
direct sum of the minimal ideals E {2) d k (1 <k<q). Correspondingly, we find 
the decomposition 

H= t'Hd k (15e) 

k= 1 

of if into the direct sum of the £-bimodules Hd k which are characterized as 
the set of all elements of if with d k as right identity. Let us number the d k in 
such a way that the kernel of the diagonal £-epimorphism 

£ (2) -£: 8 

(a 1 x a 2 )S = a x a 2 (a l9 a 2 e £) 
coincides with the maximal ideal 


ZE' 2) d k 


k = 2 


(15f) 
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It follows that <5 restricts on E {2) d l to an F-isomorphism onto the E- 
bimodule containing E. 

As a matter of fact Hd t consists of all elements of H centralizing E so that 
by assumption 

Hd i =E = E i2) d 1 3. 

The £-bimodule H is operator isomorphic to the F-bimodule F (2) . 
Indeed, there is a regular element x of H for which x~ l Ex n E = F. Hence 

H = x~ 1 ExE = ExE 


and the mapping 


E {2) -► H : i 


(u! x a 2 )i = a 1 xa 2 («i, a 2 e E) 

F 

provides an £ <2> -operator isomorphism. 

In order to discuss the multiplicative interaction of the components let us 
begin with the case that £ is a normal extension, which is tantamount to 
q = n or, equally well, to the statement that each component Hd k is n- 
dimensional over F. Denoting by G the automorphism group of E over F we 
may number the primitive idempotents of £ <2) by the elements of G, say d(a ) 
(a e G) in such a way that 


ad(a) = d(o)(a<j) (a e £) 

(15g) 

Ed((j) = d(o)E = E i2) d(<r ) 

(15h) 

ad((r)i = d(<r)i(a<j) (a e £) 

(15i) 

Ed(a)i = d(a)iE = Hd(o), 

(15j) 


and the elements y of the component Hd(a) of the direct decomposition 
(15c) are characterized by the relation 

ay = y(ad) (a e £). (15k) 


Since for a, z of G we have 

ad(a)id(r)i = d(a)i(a(r)d(t)i = d(a)id(r)i(aa)x. 


it follows that 


d(cr)id(T)i = d(ar)ic(a, r) 


(151) 
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where 

c(p, a)xc(pa, t ) = c(p, <tx)c(<j, t ) (15m) 

d(l c )i = l a , c(l G ,l G )=l E =l H (15n) 

Hd(o)oHd(x) = Hd(ax) (15o) 

[Hd(a), Hd(x)] £ Hd(ax) + Hd(xo) (15p) 

[y, a] = y(a - aa) (ye Hd(c r), a e G, a e E). (15q) 

The last two results show that the decomposition 

LH= (15r) 

<7 G G 


of the E- Lie algebra LH associated with H into the direct sum of linear 
subspaces over F is the spectral decomposition of LH with respect to the 
abelian subalgebra 

LE = Hd( 1 G ) (15s) 

that is associated with E. 

In the general case we embed E into a minimal splitting extension of E 
over F and we embed as maximal abelian F-subalgebra into the ring of 
matrices H t of degree \<S >: £1 over H. Hence E = <t> r\ H. 

Let G be the group of automorphisms of O over E , let S = Aut(O/F) 
and let 

<D (2) = 0> ® 0> = X<D (2) d(<r) 

F a e G 

where the d(a) are primitive idempotents of <I> (2) such that 
ad((j) = d(a)aa (a e G, aeO) 

and i is a 0 (2) -isomorphism of the regular <I>-bimodule <D (2) on H x mapping 
0 (2) d( 1 G ) on O. Hence there is the direct decomposition 

H,= Y;Hid(o) (16a) 

a € G 

of the O-bimodule H x with components 

H x d(a) = {y(y e H t & Va(a g <I> => ay = y(aa))}. (16b) 

The tensor algebra 

£< 2 > = £(x)£= t' Ei2) d k (16c) 

F fc= 1 

with primitive idempotents d l9 d q is a subalgebra of 0 (2) such that 
d x = d(l). The question is, which way is the direct decomposition of the 

F-bimodule H = Hd k related to the Remak decomposition (16a)? 
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Any idempotent of ® l2) is of the form 

d (&)= E d W ( 16d ) 

a e 

where ft is a nonempty subset (complex) of G. The question is, which 
complexes are involved in the presentation of idempotents d k l 

Choose an F-basis b u b n of E. Any element of E {2) is of the form 
x = Yj= i x x t with Xi uniquely determined by x in E and 

x = xl = x Y, d(cr) = ( Z bi x *i ) d(d) 

ueG \i= 1 / 

n n q q k 

= E E = E E E d(S<r k a kj )(b i 0 k <T k j)x i 

i= 1 <n=C i=l k=l j= 1 

where G =(J?=iS(T t S, S(T k S =(J?=i Sff* o kj with a kj in S and E <lk — n - If 
x = d(ft), then 


" (I 

E ( b i°k<7kj)Xi = j a 

1=1 1 U 


if So k o k j S ft 
if S(r k S£Si. 


(16e) 


On the other hand, the system of linear homogeneous equations 

n 

E ( b i = y*r (I <j<q k , 1 <k<q) 

i= 1 

(1 < k! < q k ,, 1 < k f < q) 

for the unknowns x l5 ..., x„ 

has precisely one solution in O because of the separability of E over F. But 
upon application of an element o of S to the system the n left-hand sides only 
are permuted though the x, are changed to x ( a. Hence, if x, € F, then the 
permutations connected with the elements a of 5 leave the right-hand sides 
invariant. 

This means that ft is a union of several double cosets Sc k S. On the other 
hand, setting y k , jf = d kk , we find that x t a = x { for all a of S so that x t belongs 
to E and d(Sa k S) belongs to F (2) . 

It follows that the primitive idempotents of F (2) are precisely the idempo¬ 
tents d(Sa k S) corresponding to the double cosets. Hence we have 

d k = d(Sa k S) (17) 

after suitable numbering of d u d q . Thus the multiplication rules 

Hd k Hd k , = Hd(S<7 k S(7 k ,S) 


(18a) 
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and the Lie multiplication rules 

[Hd k , Hd k ,] £ Hd(So k So k ,S) + Hd(Sa k ,Sa k S) 

(1 < k < q, 1 < k! < q) (lBb) 

emerge. It should be noted that for any set of double cosets 
{Sa kl S ,..., Sa km S\l <k k <k 2 < • < k m < q} 

we have the rules 


(PM 

= Y j d(S<j kl S) 

i= 1 

(19) 

(u> s ) 

m 

= Y.Hd ki 

1 = 1 

(20) 


which tell us all worthwhile knowing about the evaluation of the right-hand 
side of (18a), (18b). 

4. Crossed Product Rings 

For any unital ring R there is the canonical homorphism 
U(R) - Aut(R): 0 

a(x0) = x~ l ax (a e R, x e U(R)) (21) 

of the unit group U(R ) of R into the automorphism group, Aut(R), of R. 
Its image is a normal subgroup Inn(R) of Aut(R), the inner automorphism 
group of R, which is normal in Aut(R) according to the transformation rule 

a' i (x8)ot = (xa)9 (x e U(R), a e Aut(R)). (22) 

The factor group is the outer automorphism group of R 

Out (R) = Aut(R)/Inn(R). 

Supposing R is contained in a unital overring A such that 

1a= 1 r - (23a) 

The normalizer of R relative to the unit group U( A) of A is defined as 
the subgroup 

N V{A ,(R) = {x\x e V( A) & x 1 Rx = R} (23b) 

of U(A) containing U(R) as normal subgroup. 

For any subgroup H of N U{/k) (R) we have the relations 

hR = Rh (he H) 


(23c) 
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so that R and H generate the subring 

HR = (R, H) = RH (23d) 

of A. We set ourselves the task of studying the structure of HR in terms 
of H and R. 

Analysis of the Problem 

Without loss of generality it may be assumed that 

U(R) £ H £ N V(A) (R), (24a) 

HR = A = RH. (24b) 

We observe that the kernel of the homomorphism 
H -* Aut(R): <p 

a(h(p) = h~ { ah (aeR, heH) (24c) 

is the centralizer of R in U( A): 

C VW (R) = {x|x € U(A) & Va(a e R =>xa = ax)} 

and that the kernel of the corresponding homomorphism of H into the 
outer automorphism group of R 

H -* Out(R): (p 

h(p = li(/) Inn(R) (h e H) (24d) 

is the product of the two normal subgroups U(R), (C V(A) (R)) n H. The 
ring Ri = <R, C W(A) (R)> generated by R and C V{A) (R) is the product of the 
two elementwise commuting subrings R and R 0 = <C [;(A) (R)) intersecting 
in a subring of the center 5 R of R. There hold the relations 

hR t = Rih 

for any element h of H so that 

<R,H> = HR, = R 1 H= £ *Ri- 

heH 

Without loss of generality we can also assume that 

C t/(A) (R) £ ?R, ker <p = l/(R). (24e) 

Hence we have the exact sequences 

1 -*• U(R) —^ H S- 1, 

l_l/( 3 R) , H Aut(R) (24f) 
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and the conditions 

R h = {a\a e R & Vh\h e H =>a = a(h(p)} ^ 3 (R), i x (p = 9 (24g) 

when is the natural embedding monomorphism of U(R) in H , is the 
natural epimorphism of H on the factor group S = H/U(R ), and q> is a 
monomorphism of H in Aut(R). 

Thus H emerges as a special kind of group theoretic extension of the 
unit group of U(R) characterized by (21), (24a)-(24c), (24e), and (24f) which 
we will call an R-admissable extension of U(R). 

Our task is to describe the rings A satisfying (24a) and (24b) in terms 
of the R-admissable extension H of U(R). 

Definition The crossed product of a unital ring R and R-admissable 
extension H of the unit group U(R) of R is defined as the ring R x H 
with the generators a(a e R), h(h e H) and the defining relators 


a l + a 2 = a l + a 2 , a Y a 2 -a x a 2 (a u a 2 e R), (25a) 

hji 2 = hjk 2 (h u h 2 eH\ (25b) 

xT!=x (xe[/(R)), (25c) 

h~ l ah = a(h<p) {a e R, h e H). (25d) 

Theorem 2 (a) The mapping 

R^RxH:i 2 

ai 2 — a (ae R) (25e) 


is a monomorphism of the ring R into the crossed product of R and H 
mapping the unit element of R on the unit element of R x H. 

(b) The mapping 

H^U(R xH): i 3 

hi 3 = h (he H) (25f) 

is a monomorphism of H into the unit group of the crossed product of R and H 
such that 


l!l 3 = U(R)\l 2 

(25g) 

Hi 3 Ri 2 = R x fj = Ri 2 Hi 3 , 

(25h) 

Nu(R X H)(R l 2)R l 2 — R x H = Rl 2 ^U(R X H)(Rl 2 )> 

(25i) 

ClJ(R x H)(R l l) ~ 3**2- 

(25j) 
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(c) Identifying R with Ri 2 via the embedding monomorphism i 2 and H 
with Hi 3 via the embedding monomorphism i 3 the crossed product of R and H 
appears as the universal solution of our task of which all other solutions are 


obtained by the application of an R-epimorphism. 

Proof Let T a transversal of H modulo U(R)i , such that 

H=T(U(R)i l ) = (U(R) h )T, 

T n T(xi 1 ) is empty whenever \ ^ xe U(R\ 

l«eT (26a) 

t't" = t(c(t',t")i 0 (t',t",teT- c(t', t") e U(R)). (26b) 

The set (R x H)* of all formal sums T t(ta) (a any mapping of Tin R 
with only a finite number of elements of T with nonzero image) with the 
operational rules 

£t(ta) = £ t(tb)oa = b, (26c) 

teT teT 

£ t(ta) + £ t(tb) = £ t(t(a + b)), (26d) 

teT teT teT 

( I'(fa)) I'('*>) = £t(t(a*b)), (26e) 

VsT I teT teT 


where the mapping a x b of T in R is defined by setting 

t(a xi)= X t"Wa(t"<PW'b)l 

t‘ 6 T, t" e T 
t't"V(R) = tV(R) 

is verified to be a ring by an easy computation. The mapping of a on 1 H a 
(aeU(R)) and of r(xii) on tx (teT, xeU(R)) preserves the defining 
relators of R x H as another easy computation shows. Moreover, the 
elements l H a, tx of above generate (R x H)*. Hence the mapping of above 
extends uniquely to an epimorphism e of R x H on (R x H)*. 

On the other hand for any solution A of our task the mapping of the 
formal sum «= T t ( ta ) on the corresponding sums defined in A preserves 
the operational rules (26d) and (26e) as a third computation will show. 
Thus it is shown that e is an isomorphism and that (26c) holds. 

The mapping of a on l H a (a e R) turns out to be a monomorphism 
of R in (R x H)*. Thus (a) is demonstrated. Note that 1 H a is the unit 
element of (R x H)*. The mapping of r(xi,) on tx (t e T, x e U(R)) turns 
out to be a monomorphism of H into -'Vcuk* //>*)( 1 // mapping xi x on l H x 
(x e U(R)). Thus (25b), (25h), and (25i) are demonstrated. 

Finally let us consider those elements z = 1 H a (aeR) which commute 
with all elements tx (teT, xe U(R)). Each element h of H is of the form 
h = r(xi 2 ) for some t of T, x of U(R) and conversely; the commutativity 
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relation z(tx) = ( tx)z is tantamount to z(hcp) = z. Hence according to (24f) 
we have z e 3 (R), demonstrating the remainder of Theorem 2. 

Extension Invariance 

One of the criteria applied to any algebraic construction is the question 
how well it behaves in regard to an extension of the basic domain of 
rationality. 

Suppose the given unital ring R is embedded by means of an embedding 
monomorphism 

R-Q:i k (27a) 

into a unital ring Q such that the unit element of R is mapped by i R on the 
unit element of Q. 

Is it possible to embed a crossed product of R with an K-admissible 
extension H of U(R) into a crossed product of Q with some Q-admissible 
extension H n of U(Q) such that the embedding monomorphism 

R x H x H a : i RxH (27b) 

restricts to i R on R but to a monomorphism into H n on H and that the 
ring Q x H n is the ring generated by the subrings (R x H)i R xH , Q? Is it 
true that H n = H lRxff U( Q)? 

We observe that necessarily the homomorphism 

H q —► Aut(Q): cpn (27c) 

inherent in the construction of Q x H n restricts to a homomorphism 
H, R t H \<p a for which 

(ai R )(h(p n ) = oc(hcp) (ae R,he H), (27d) 

a condition which restricts our choice of Q. 

We consider only the case that Q is the tensor product of the ring R 
and a unital overring S of the central subring R H of R over R H : 

Q = R(x)S (27e) 

RH 

when it is required that 

1 s = 

and that the canonical embedding homomorphism (27a) mapping a on a ® l s 
(a e R) is a monomorphism. Without loss of generality the canonical 
homomorphism 

S —> Q: i s 

si s = 1 r®s (se S) 
can be assumed to be monomorphic. 
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Now there is the canonical homomorphism (27c) given by setting 

(a ®s)(Vn) = MM) x s (27f) 

satisfying (27d). 

The condition (27b) requires H n to be a homomorphic image of the 
amalgamated semidirect product of H and the unit group of Q over U(R\ 
say 

U(R) 

H □ 

<pCl 

Here quite generally we define the amalgamated semidirect product 

N 

AUB 

of two groups A, B over a common normal subgroup N as follows. 

We assume N to be a normal subgroup of A with given monomorphism 
i N into B such that Ni n is a normal subgroup of B. Furthermore there is 
a homomorphism ¥ of A into the automorphism group of B given satisfying 
the coherence condition 

n0 A 'V = m B 0 B (neN) (27g) 

when 0 A , 0 B denote the canonial epimorphism of the elements of A , B , 
respectively, on the corresponding inner automorphism groups. Let 

N 

AU £ = {aUb\ae A, b e Bj (27h) 

with multiplication rule 

(a □ b)(a f □ V) - (aa f ) □ (b(a^)b f ) (27i) 

and equality rule 

(a, a' e A; b, V e B) a □ b = a' □ h'oa ,_1 a e N & a ,-1 ai N 

= Vb' 1 . 

One verifies easily that A B is a group with canonical embedding 
monomorphisms 

N N 

A-+ A n B: i A , B-+A □ B: i B 

ai .4 = a □ l fl , bi B = 1 a □ b 
satisfying the coherence condition 

TV | ix = i N i B , 


(27k) 
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moreover 

N 

Ni a ^AO B , Bi b o X □ B, (271) 

N 

(Ai a )(Bi b ) = ABB, Ai A n Bi b = Ni a . (27m) 

A particular example are the semidirect products of A, B arising whenever 
N = 1. We denote them simply as A □ ^ B. 

Continuing with the construction of an extension of the given crossed 
product R x H by means of the tensor product (27e) we let 

U(R ) 

H a = H □ C/(Q), (27n) 

<p£l 

Aut(Q): <p& 

(h □ e)(p£ = h(p n (E0 n ) (27o) 

(i heH , ee U( Q), 0 a the canonical epimorphism of G(Q) on Inn(Q)). It 
follows that H n is an Q-admissible extension of U( Q) such that the crossed 
product Q x H q is an extension of R x H in the sense defined in the 
beginning of this section. Let us observe the isomorphy relation 

Q x Hn^(R x H)®S (27p) 

R H 

with canonical isomorphism mapping the element (a ® s)(h □ e) on 
(ah (x) s) e (a e R, s e S, he H, ee U( Q)). 

Examples 

(a) The group ring B[G] of a group G over a unital ring R is a crossed 
product of the subring R[N] and G for any normal subgroup N of G that 
contains its centralizer in G according to 

R[G] = R[N ] x GU(R[N]). (28a) 

(b) Suppose R is a unital ring with primitive idempotents E u E a of 
the center of R such that E x + * • • -I- E a = l R and H is an B-admissible 
extension of U(R), then the automorphism group Hq> permutes the finite 
set S = {E u ..., E a } such that S is partitioned into a finite number S l9 ..., S ff 
of H<p-orbits. It follows that the idempotents 

E Sj = I E t (1 <j<o) 

EieSj 

are invariant under the automorphisms of R belonging to H(p and that 
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the idempotents £ Sl , • ••, E Sg are orthogonal idempotents of the center of the 
crossed product R x H with the unit element as their sums so that there 
is the split 

R x H = £e s .{R x H) 

j= i 

of R x H into the direct sums of a ideals of R x H. 

For each of those ideals let us form the centralizer C h (E Sj R) of the 
subring E Sj R of E Sj (R x H) relative to H and a transversal 7} of C H (E Sj R) 
modulo the normal subgroup C H (E Sj R) n U(R). It follows that the ring 
E S .(R x H) contains the subring 

Rj= ZtEsjR 

t e Tj 

invariant under Hep and that E Sj (R x H) is a crossed product of Rj and 
E Sj H so that 

R x H ~ ®J.j Rj x E Sj H. (28b) 

In this way the study of the crossed products for a ring R that is 
isomorphic to the algebraic sum of a finite number of algebraically 
indecomposable rings is reduced to the case that those indecomposable 
components are transitively permuted by the group of automorphisms Hep. 

(c) Let us study the crossed product of a semisimple ring R with an 
^-admissible extension H of U(R) such that the primitive idempotents 
EE a of the center of R are transitively permuted by the automorphism 
group H(p. According to the second structure theorem of McLagan- 
Wedderburn there is the decomposition 


R = X Rj when Rj = EjR , (29a) 

j= i 

of R into the discret sum of o simple ideals. By construction the simple 
components of R are isomorphic to one another. 

Let us analyze this situation! 

For any simple ring R t there is the restriction homomorphism 

Aut R x -> Aut^Kj): Pi 

i = 3^i|a (a e Aut(Rj)) 

of the automorphism group of Ri into the automorphism group of \R ,. 

Each automorphism a of R permutes the primitive idempotents of the 
center of R so that there is the permutation representation 


Aut R->p„: A 

EjaA = EjO. 


(a e Aut R). 


(29b) 
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In fact 

(Aut R) A = p„ (29c) 

and Aut R is the semidirect product of the kernel of A and the subgroup 
formed by the automorphism 

R -* R : n 

| E fl ( = E 0 i* ( 29d ) 

(n e p a , a { g R l9 6i an isomorphism of R x on R ( (1 < i < (j), 6 X = l Kl ). 

The kernel of A consists of the automorphisms of R that leave every 
minimal ideal invariant. It is the direct product of the subgroups AutpQ 
formed by the automorphisms of R fixing the complementary ideal of R { 
in R element wise (1 < i < a). 

There is the canonical monomorphism 

Aut R t -> ker A: v f 

( Z + Z a j°j 

\j =1 / 1 

j*i 

(dj e R t (l <j <a), a e Aut K f ) (29e) 

sending Aut R t on Aut R, and, of course, there are the isomorphisms 

Aut K; = Of J (Aut Ri)6i (29f) 

of Aut R t and Aut Rj for i = 1, 2, ..., a. 

In this way the automorphism group of R is seen to be isomorphic 
to the wreath product of p„ over the automorphism group of R u the 
outer automorphism group of R is isomorphic to the wreath product of 
p„ over the subgroup Aut^i)^, of Aut(ji? 1 ): 

Aut R ~ Aut R&p a , (29g) 

Out R ~ Out K t £p ff , (29h) 

Aut Rj si Aut (1 <j < a), (29i) 

Out R ~ (Aut Ri)n v (29j) 

The factor group of an R-admissible extension H of U(R) over U(R) 
is isomorphic to a subgroup S of Aut R, in fact there is the exact sequence 

l^U(R) -!!— H -3US-vl. (29k) 

By assumption the permutation group HA of a letters is transitive. 
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Conversely, if S is a subgroup of SA such that H 0 fiA is transitive 
and if there is the exact sequence (29k) for H find the crossed product 
R x H with the property that 1* is the only idempotent of R invariant 
under Hep. 

When is the crossed product simple? In view of the conjugacy of the 
orthogonal idempotents Ej (1 <i< a) under H the simplicity of R x H 
implies that the subalgebra Ey(R x H)E x is simple. 

If the element h of H does not belong to the //-centralizer Hy of Ey, 
then there holds an equation 

Eyh = h Ej 

with j > 1 so that E 1 hE l = 0. Hence 

Ey(R x H)E 1 = HyRy = RyHy. (30) 

Furthermore the elements of H , commuting elementwise with K) form a 
normal subgroup H\ of H 1 such that 

EiH* a tf i/Aut «!, 

U^RjAE^tEy, 

there is a transversal T x of E { H*Ey over U($Ry) such that 
EyH*yEy = TyU^ll T t O U^Ry) = Ey, 
xTy n U{iRy) = <t> (1 *xeU{zRy)), 

U(iRy)H*y = U(tRy)Ty, 

Ty is linearly independent over 3 Ry and U($Ry)Ty is the centralizer of 
Ry in RyHy. 

The simplicity of RyHy implies simplicity of the centralizer of the simple 
subalgebra Ry. As can easily be shown, Ty is finite. As the example 
provided by any quaternion algebra over a field of characteristic # 2 shows 
it can happen that Ty consists of more than one element. 

Conversely, if C RlHl Ry is simple, then it follows from E. Artin’s argument 
on nonvanishing sums of the form 

£ MM 

heXi 

(A - ! a transversal of Hy modulo the normal subgroup N x of all elements 
n of Hy for which tup belongs to Inn(£); a is a mapping of Xy in Ry with 
all but a finite number of images equal to zero) which are contained in a 
given ideal of Ey(R x H)Ey that the subring Ey(R x H)Ey is simple. 

Now it follows from E. Artin’s argument applied once again in another 
context that R x H is simple. 
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Theorem 3 (a) Any semisimple ring R is of the form 

R= t E jR 

j=i 

where E u ..., E a are the primitive idempotent elements of the center of R and 
the numbering is chosen in such a way that there are isomorphisms 

E<r k R EjR: Oj 

K-l <j << 7 fc , e ak = l Eak R, l</c<T, T> 0 , 

0 = <J 0 < <T X < (T 2 < ' " < &* = a ) 

but that the simple algebras E ak R are nonisomorphic for k— 1, 2, t. 

(b) There is the permutation representation 

Aut (R)^p a : A 

EjOl = Ej^ (a g Aut(R), 1 < j < a) 

with t orbits S k = {a k _ x + 1, ..., <r k } (1 < k < t). 

(c) The automorphism group of R is the direct product of the subgroups 
(Aut Reformed by the automorphisms of R which leave the idempotent 

<*k 

Es k = I Ej 

j = O k - 1 + 1 

as well as every element of the complementary ideal (l k — E S )R of E s R fixed 
(1 < k < t). 

(d) The direct component (Aut R) k is isomorphic to the wreath product 

of Aut(Rj and (1 < k < t). 

The outer automorphism group of E Sk R is isomorphic to the wreath product 
of (j/?J|Aut(Rj and u (1 < fe < t). 

(e) The crossed product of R and an R-admissible extension H of U(R) 

is simple precisely , if t = 1, Hep A is transitive and the centralizer of the 
projective group algebra C H E X R is simple. 

It is a hypercomplex system over some field precisely if (e) is satisfied 
and if 3 (R) is of finite dimension over the field C R (H ), i.e., C H (E t R) is of 
finite index in C H (£ X ). 

It is of the O. Teichmuller type precisely if (in addition) R is 
commutative and C H (E X ) = C H (E X R). 

It is of the classical A. Albert-R. Brauer-H. Hasse-E. Noether type 
if (in addition) R is afield. 

It is of the Dickson-Wedderburn type if (in addition) R is a finite cyclic 
extension of R H . 
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Corollary The automorphism group of a ring of matrices R = R{, xf over 
the unital commutative ring R 0 is the semidirect product of the centralizer of the 
matrix units in Aut(R) and the inner automorphism group . The first factor is 
isomorphic to the automorphism group of R 0 with the canonical isomorphism 
which maps the matrix ( a ik ) on the matrix ( a ik oc ) for a in Aut(R 0 ). 

5. Crossed Product Orders 

Let £ be a semisimple ring and let H be an ^-admissible extension of 
U(E) such that the crossed product A = E x H is simple of finite dimension 
over the center F = E H . 

In this section we study the arithmetics of certain orders of A over 
dedekind rings R of F with F as quotient field which are related to the 
presentation of A as a crossed product. 

Definition The R-order A of A is said to be a crossed product order 
relative to the given presentation of A as crossed product over E if there 
holds the decomposition 

H : V(F) 

A = X 'An(Hi£) (31a) 

i = 1 

of A into the direct sum of the intersections of A with the crossed product 
components of A , where 

H : V(F) 

H= U ' H i9 H, = U(E) (31b) 

;= l 

is the coset decomposition of H modulo U(E) and where 

H : V(F) 

A = £ //,£, (31c) 

i= 1 

(A n H t E)( A n HjF) = A n H.HjE (i, j = 1, 2, : U(E)). 

(3 Id) 

We observe that because of (31a) a crossed product order A satisfies the 
invariance condition 

N h { A n U(E))U{E) = H. (31e) 

Theorem 4 (Benz-Zassenhaus) Let A be absolutely semisimple over F . 
Every crossed product order A is embedded into a crossed product order A** 
intersecting with E in a hereditary order such that the embedding is invariant 
under all F-automorphisms of A. 
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The intersection of all maximal orders of A containing A** is a hereditary 
order A* of A. 

Proof “Hereditary orders” emerged in the theory of orders as an appro¬ 
priate generalization of “maximal orders” first conceived by Auslander and 
Goldman and independently by H. Benz about twenty years ago. Of the 
many equivalent definitions of hereditary orders of a semisimple hypercom¬ 
plex system B over F let us mention the following definitions. 

(HI) The order of B is said to be hereditary if any A-sublattice M 1 of a 
A-lattice M has a complement in M precisely if the A-factor module M/M t 
is a A-lattice (Auslander-Goldman). 

(H2) The order A of B is hereditary precisely if for every prime ideal -J) 
of R the s J)-radical of A, i.e., the ideal J^(A) of A containing ' 2 )A with the 
property that J^(A)/'1)A is the maximal nilpotent ideal of A/'?}A, is invertible 
relative to A (Auslander-Goldman). 

(H3) The order A of B is hereditary precisely if for any prime ideal'}) of 
A the left order of J<g(A), i.e., the R-dedekind module 

[J 9 (A)/J v (A)\ = {a\ae A e aJ v (A) £ ^(A)}, 
coincides with A (Zassenhaus). 

(H4) The order A of B is hereditary precisely if for any prime ideal s 2) of 
A the right order of J^(A), i.e., the R-dedekind module 

[J v (A)\f v (A)] = {b\beAe J v (A)b £ J„(A)} 
coincides with A (Zassenhaus). 

(H5) The order A of B is hereditary precisely if for any prime ideal 9) of 
A its A-idealizer, i.e., the R-dedekind module 

[J^(A)/J^(A)] n [*/<o(A)\J^(A)], 
coincides with A (Zassenhaus). 

(H 6 ) The order A of B is hereditary precisely if for any prime ideal s 3) of 
R the ^-localization of A, i.e., the (R/(R — s })))-order 

—— = {r ‘a \a € A & A € R, A £ '})} 

R — 5) 

is hereditary over the ^-localization 

—— = a~ l r\r e R & A e R, A £ '})}, 

R — f) 1 1 

of R (Auslander-Goldman). 

(H7) The order A of B is hereditary precisely if for any prime ideal'/) of R 
the 'J)-adic completion of A, i.e., the R^-order A fl of all '3)-adic limits of 
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*})-adic convergent sequences of elements of A, is a hereditary order over the 
^)-adic completion ring R v of R (Auslander-Goldman). 

(H8) The order A of B over R is hereditary precisely if the intersections 
of A with the simple components of B are hereditary over R and if A is the 
direct sum of its intersections with the simple components of B 
(A uslander-Goldman). 

(H9) The order A of a commutative semisimple hypercomplex system 
B over the quotient field F of a dedekind ring R is hereditary precisely if it is 
the maximal order A(£, R) of B over R (Auslander-Goldman). 

(H10) The order A of a division ring B of finite dimension over the 
quotient field F of a complete local dedekind ring R is hereditary precisely if 
it is the maximal order A(B, R) of B over R (Auslander-Goldman). 

(Hll) The order A of B isomorphic to the ring of matrices D fxf of a 
division ring D of finite dimension over the quotient field F of a complete 
local dedekind ring R is hereditary precisely if D and the f 2 matrix units e ik 
(U k — 1, 2, ...,/) for which 


e ap e <x'p> ~ 

~ V e <*P' 9 

e^pX — Xe a p 



(a, £ a', p' = 1, 2, ...,/; A e D) 

f 

f 

f 

B= I 

I De aP , 

OQ 

II 

» 

w 


a=l 0=1 a = 1 


can be transformed by a suitable inner automorphism of B in such a way 
that 

A = A(/„/ 2 ,...,/ s ; d, r) 

= I S E 4** & 4 e A (D, R) 

U = 1 k= 1 

if I fh < i < E f ». £/*<&</ 

i j= i ;=i 

& l ik G J(A(D, R)) if 1 < k < X f h < i 

j= i 

(s > 0 \fi > 0 (1 < i < s); 

J(A(D, R)) the Jacobson radical of 
A(£), R) which coincides with the 
only maximal ideal of 
A(Z>, R)) (Auslander-Goldman). 

For the proof of the first part of Theorem 4 let us point out that every 




A THEOREM ON CYCLIC ALGEBRAS 


387 


maximal order of A is hereditary. Also, because of the absolute semisimpli¬ 
city of A the discriminant ideal 

9 (A/R) = ideal of R generated by the determin¬ 
ants det(tr(a f b k )) for all 2n-tuples a„ 
...,a„,b 1 ,...,b n of elements of A and 
the reduced system trace function tr 

is not zero and that consequently for all prime ideals '}) of R not dividing 
9(A R) the ^-localization (A /(R — '}})) is a maximal order of A over 
(R/(R - '}))). The discriminantal ideal 9(A R) is the power product of finitely 
many distinct prime ideals -))j, '}), of R. 

(H12) A is hereditary precisely if the finitely many localizations 
(A /(R — *“?)()) are hereditary over (R/(R — '}),)) (t = 1, 2,..., t) where —, 
% run over the distinct prime ideal divisors of 9(A /R). 

Form J HJilK) {A) = f)‘ = i J%{ A), and its A-idealizer A (1) . It follows that 
A is hereditary precisely if A = A (1> . If A is not hereditary, then A <= A U) . 
Setting A <<+1> = (A ,0 ) (1) for i = 1, 2, ... it follows that there is a nonnegative 
integer Ji(A) such that A <WA)) = A* is hereditary, and moreover, 

A c A (1) c: • • • c A (i|A ®. 

It is clear that the formation of A* is invariant under the automorphisms of 
A; A** = ]T'A* n {H,E) is a crossed product of order A, A * n E is 
hereditary. 

Thus the first part of Theorem 4 is established. 

For the proof of the second part, use (H8). We may assume without loss 
of generality that A is simple. 

Using (H6) and (H7) we may assume without loss of generality that the 
dedekind ring R is locally complete. 

If (p is a finite unramified extension of the field F, then the integral 
closure of R in (p is a dedekind ring R v with q> as quotient field such that the 
Jacobson radicals are related by 

J(R V ) = J(R)R V . 

For any F-order Q the tensor product ring Q ® R is an Reorder with 
Jacobson radical 

j|q®r„J = J(Q)®R v . 

In case fi is hereditary then it follows from (H3) that Q ® R R,, is heredi¬ 
tary, and conversely. 
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Applying a suitable unramified finite field extension to the ground field, 
it may be assumed without loss of generality that A is a ring of matrices of 
finite degree n over F, that the center of E is the algebraic sum of a 
isomorphic fully ramified extensions of F, and that E itself is a ring of 
matrices of degree m over its center. It follows from the corollary of Theorem 
3 and from (Hll), that we may make the following additional assumptions 
without loss of generality: 

E = t'ejE, 0 *ej = ejez(E), 
j= 1 

e J e j> = ° O'f/; jJ' = i, 2, X e j = i £ , 

j= i 

m 

(1 <j<o) 

a= 1 

e jxfi e kx'P' = djk dp*' e jxp' 

(1 <j < <y, 1 < k < o\ a, ft, a', >9' = 1,2,..., m), 

a m m 

*= e e r z 

fieHo j = 1 a=l 0=1 

ct m m 

j=l «=1 0=1 

5(a,/?) = 0 if < a < , m fl - l < fi < m, 

0 = m 0 < m x < * * • < m s = m, 

<5(a, /?) = 1 otherwise, 

H = H 0 OU(E), H 0 a finite group, 

1 -> H 0 ———► Aut(£) exact, H 0 (p 0 n Inn(F) = \ E , 

H 0 (p 0 acting transitively on e l9 ..., e a , the primitive idempotents of the 
center of F according to the operational rule 

ej(h(p 0 ) = e J(hA) (h e H 0 , 1 < j < a), 

h~ l e jap h = e j(hAyxfi (he H 0 , 1 <y < <r; a, /? = 1, 2, m), 

A = X' W(K) V( *>(A n £) (v(/j) g Z for h of H 0 ), 

he Ho 

v(h'h") = v(h') + v(h") (h\ h" e H 0 ). 

But since H 0 is finite and Z is torsion free, it follows that v(h) = 0 for all h 
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of H 0 . Hence 

A = £-k(An£). 

h e Ho 

There are elements h ( of H 0 satisfying 

K 1 e i h i = e i (1 < i < cr), 

so that the elements 

e ik = e,K i e l h k e k (i, k = 1,2,..., a) 

satisfy the rules for matrix units 

e ik e i'k' = ^ki' e ik ' (j, fc, i , k — 1, 2, . .., d). 

Furthermore, 

e .. = e . (j= 1, 2, a). 

Hence 

As((An e^H) x H 01 U( A n e t 

where 

H 01 ={h\he H 0 & e x h = he x } 

is the // 0 -stabilizer of acting on the matrix units. We may assume without 
loss of generality that o = 1 so that the order A is isomorphic to the tensor 
product ring over R of the hereditary order 

Ai = E £ and = i( E ) * H o ■ 

«=1 0=1 

We apply the 

Lemma The tensor product ring of a hereditary order Q x over R with R 
as center and of any hereditary R-order fi 2 with absolutely semisimple central 
quotient ring is a hereditary order. 

Proof Using (H 8 ) and the invariance under finite unramified extension 
of the field of reference it suffices to consider the case that 

FQ t = F miXmi , FQ 2 - ( 3 FQ) m2Xm2 . 

Now the lemma follows from (Hll) by explicit computation. 

The application of the lemma reduces the proof of Theorem 4 to the case 
that E is a separable normal finite purely ramified extension of F, and that 

H = H 0 □ U(E ), H 0 <Po = Aut(F/F), A = x H 0 , A s F n *". 
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In this case the integral closure R E of I? in £ is generated over R by an 
Eisenstein element £ satisfying the equation 

fit) = t n + t = 0 (J(R) = nR ; A, € R (1 < i < n); A„ = 1). 

i= 1 

where f(t) = t" + £" =1 is an irreducible monic polynomial of R[f]. 

Choosing suitable matrix units of A we may assume that A = F "*", 
A £ R nXn and that 



is the accompanying matrix of / so that A is contained in the hereditary 
order A of all matrices (a,*) 

(a,* e R, <x ik e nR if k < i (i, k = l, 2,n)). 

Any maximal R-order of A containing A is of the form X~ l R n *"X with 
nonsingular matrix X of E" x " such that 

A e X~ l R n * n X, XAX' 1 £ R" x ". 

But all irreducible R-representations of A are R-equivalent to the natural 
representation, so that there is also an element Y of GL(n, R) for which 
Y -1 AF = XAA -1 . Hence 

yXA(YX) _1 =A YX e RA, 

YX = e£ v with v e Z, £6 (7(A), X = Y~ ‘e£ v , 

X~ l R"* n x = £ -v £ _ 1 YR" X "Y“ 1 e£ v = £- vk" x "£ v 2 £" V A£ V . 

But by construction we have 

£->A£ = A 

so that indeed 

X~ l R nXn X^ A. 

Consequently, A is contained in the intersection of all maximal orders con¬ 
taining A. On the otherhand, we know from Auslander-Goldman that every 
hereditary order is the intersection of the maximal orders to which it be¬ 
longs. Therefore, the intersection of all maximal R-orders of A containing A 
is the hereditary order A. Thereby Theorem 4 is established. 
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Applications 

I. Let G be a finite group containing a cyclic normal subgroup N that is 
its own centralizer in G and let A be a faithful irreducible rational integral 
representation of degree f 

It follows that the rational linear combinations of the matrices gA 
(g e G) form a simple hypercomplex system QGA with Z-order IGA that is 
the crossed product ZNA x GU(ZNA). Hence the intersecti on of all maxi¬ 
mal Z-orders of QGA containing ZGA is a hereditary order ZGA. The inte¬ 
gral representation of G that are arithmetically equivalent to A fall in as 
many arithmetical equivalen ce cla sses as the irreducible integral representa¬ 
tions of the hereditary order ZGA from which they are derived by restriction 
to G. 

The locally integral representations of local hereditary orders can be 
determined explicitly. 

The integral representations of global hereditary orders can be derived 
from its localizations and a generalization of a famous theorem of E. 
Steinitz. 

II. Let A be a simple hypercomplex system over the rational number 
field J. Let E a maximal commutative subfield of A that contains the center 
of A as subfield such that E is a normal extension of A. 

Then A is a crossed product of E and N U{A) (E). There are corresponding 
crossed product Z-orders. 

The intersection of all maximal Z-orders of A containing a crossed pro¬ 
duct order is a hereditary order. 

Note that the crossed product orders fall into a finite number of equi¬ 
valence classes under transformations by non zero elements of E. 

6. Valuation Theoretic Interpretation 

In pursuit of H. Benz’s ideas we give here a valuation theoretic interpreta¬ 
tion of Theorem 4. Given a dedekind ring R with quotient field F, we 
associate with every prime ideal p of R a multiplicative valuation (p — cp p 

F -> R-°: (p v 

characterized by the rules 


x<p > 7 ^ 0 

(x € F), 

(32a) 

(xy)(p = (x<p)(y(p) 

(x, y e F), 

(32b) 

(x + y)<p < max(x<p, yep) 

(x, y e F), 

(32c) 

x(p < 1 if x e R, 


(32d) 

X(p < 1 if x g p. 


(32e) 
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It follows that cp v is discrete such that 

max xcp v = y v = y < 1. (32f) 

Given a semisimple hypercomplex system A over F we study the Kuerszak 
valuations 

<D 

of A which are characterized as those mappings O of A in U > 0 for which 


aO>Oofl^O, (33a) 

(ab)<5> < (a<S>)(b<b) (a, b e A), (33b) 

(a + b)<t> < max(a<D, b<S>) (a, b e A\ (33c) 

and among them in particular the Kuerszak extensions of q> v characterized 
by the additional condition 

F|<D = <p p , (33d) 

as a consequence of which we have <p p -multiplicativity 

(xa)<$ = (ax)<I> = (a<I>)(x<I>) (a e A, x e F). (33e) 


The trivial case R = F usually is not considered. Furthermore, upon 
proceeding to the (^-completion, we may assume without loss of generality 
that R is local with p as its only maximal ideal and complete discrete 
multiplicative valuation <p = <p v . Denoting by 

9Jt(<P, p) = {a | a e A & a<X> < p] (33f) 

the submodule of A formed by all elements of A for which the value of the 
Kuersak valuation O is not larger than p (0 < p e R) we find that the 
Kuersak valuation Q> of A is an extension of cp precisely if 

p v 9Jf(0>, p) = 9J1(<*>, y v p) (v el, pe R*°). (33g) 

We note that the multiplicativity 

an(<p, p 2 ) = PiPz) (p u p 2 6 R > o) (33h) 

is a consequence of (32b), whereas for Kuerszak valuations <1> of A in general 
only the weaker condition 

SW(0>, PiWO, p 2 ) S 9J1(0>, p x p 2 ) (p l9 p 2 eU> 0) (33i) 

holds that is tantamount to (33b). It implies that 931(0, 1) is an R-order of A. 

Definition The Kuerszak extension O of q> to a Kuerszak valuation <I> of 
A is said to be hereditary if it satisfies the conditions 
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5R(S>, Pl )M(®, p 2 ) = SW(<D, Pl p 2 ) ( Pl , p 2 g R > 0), (33j) 

Va(a g A & a f) *1(0, p) <= f) »l(0, p)^a<t> < 1). (33k) 

\ p< 1 p< 1 / 

As a consequence of these conditions and of (H3) the R-order 
A = 9Jl(G>, 1) of /I is hereditary such that 

J(A)«* = pA (e 0 el> 0), (331) 

a<D = y v / e * if a g J(A) V ; a $ J( A) v+1 (0 =£ a g yl) (33m) 


and conversely (331) implies that (33m) defines a hereditary Kuerszak exten¬ 
sion O of (p. 

Here the number e 0 is called ramification index of <I> over cp because the 
O-values ^ 0 form the cyclic group generated by y 1/e ® which contains the 
value group of cp as a subgroup of index . 

Theorem 4 admits the following valuation theoretic interpretation: 

Let A a simple ring of finite dimension over its center F with complete 
discrete multiplicative valuation cp such that A is crossed product of some 
semisimple subalgebra E. 

Then there is a hereditary Kuerszak extension <S> E of cp that is unique up 
to inner automorphism of E . 

For given 0> E there is a Kuerszak extension <t> A of (Kuerszak extension of 
Of) which is unique up to inner transformation by the units of the center of 
E. 
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